AD-A035  754 


UNCLASSIFIED 


LETTERMAN  army  INST  OF  RESEARCH  SAN  FRANCISCO  CALIF  F/G  12/1 

GENERALIZED  RESEARCH  ANALYSIS  STATISTICAL  SYSTEM.  SECOND  EDITIO— FTC (U) 
NOV  76  H LAZARUSf  T MCCAA,  R TEPLICK 


ADA0357 


INSTITUTE  REPORT  33 


GENERALIZED  RESEARCH 
ANALYSIS  STATISTICAL  SYSTEM 

SECOND  EDITION  (DECEMBER  1974) 


HENRY  LAZARUS,  SP5 
TOM  McCAA,  SPS 
RICHARD  TEPLICK,  MAJ 
MARGARET  WRENSCH,  M.S. 


DEPARTMENT  OF  INFORMATION  SCIENCES 

NOVEMBER  1976 


I 


I 

j 


i 

\: 


I 

[ 


REPRODUCTION  OP  THIS  DOCUMENT  IN  WHOLE  OR  IN  PART  IS  PROHIBITED 
EXCEPT  WITH  THE  PERMISSION  OF  LETTEPJIAN  ARMY  INSTITUTE  OF  RESEARCH, 
PRESIDIO  OP  SAN  FRANCISCO,  CALIFORNIA  94129.  HOWEVER,  DDC  IS 
AUTHORIZED  TO  REPRODUCE  THE  DOCUMENT  FOR  UNITED  STATES  GOVERNMENT 
PURPOSES. 

DESTROY  THIS  REPORT  WHEN  NO  LONGER  NEEI«D.  DO  NOT  RETURN  IT  TO  THE 
ORIGINATOR. 

THE  OPINIONS  OR  ASSERTIONS  CONTAINED  HEREIN  ARE  THE  PRIVATE  VIEWS  OF 
THE  AUTHORS  AND  ARE  NOT  TO  BE  CONSTRUED  AS  OFFICIAL  OR  AS  REFLECTING 
THE  VIEWS  OF  THE  DEPARTMENT  OP  THE  ARMY  OR  THE  DEPARTMENT  OF  DEFENSE. 

CTTATION  OF  TRADE  NAMES  IN  THIS  REPORT  DOES  NOT  CONSTITUTE  AN  OFFICIAL 
ENDORSEMENT  OR  APPROVAL  OF  THE  USE  OF  SUCH  ITEMS. 


L 


ww 


J 


UNCLASSIFIED 


Copy  ovaiJable  to  DDC  do««  no( 
Dtmilt  iuliv  legible  lepioductioa 


secumrv  CLASSIFICAT(0N  or  this  pace  (Wh»n  Dim  Bntmrmd) 


m 


r.iCENERALIZED  ^SEARCH  ^ALYSIS 

Statistical  System  . ^ ' 

I ! Second  jdition  (December  i974) ./ 


REPORT  DOCUMENTATION  PAGE 


WCPOWT  NUMBCR 

LAIR— ^33  / 

— — J 


l2.  GOVT  ACCESSION  NO 


4.  Ti  fairt 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


S.  RECIPIENT*S  CATALOG  NUMBER 


S.  Typ^  OF  REPORT  4 PERIOD  COVERED 


^INAL 


REf 


«.  PERFOmtlHG  OPS.  PEPorfTTtUMBER 


'r 


1/ 


Henry/Lazarus 
Rlchar<yTepl  Ic 


i TomAcCaa, 
Margaret^  Wrensch 


S.  CONTRACT  OR  GRANT  NUMBER^*; 


hj  M.i 


•.  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

Department  of  Information  Sciences  (SGRD-ULI) 
Letterman  Army  Institute  of  Research  / 
Presidio  of  San  Francisco,  CA  94129 


PROGRAM  ELEMENT,  PROJECT.  TASK 
AREA  a MUMMKBt 

Project  C3A7627^ 

Work  Unit  #082 


'mf9i 


It.  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

US  Army  Medical  Research  and  Development 
Command,  Washington,  DC  20314 


14.  monitoring  agency  name  a AODRESSff/ dl//«ran<  from  CanlnlUnt  Ottlem) 


IS.  SECURITY  CLASS,  (of  thim  ropori) 


Unclassified 


is«. 


decl  assification/oowngrading 

SCHEDULE 


14.  DISTRIBUTION  STATEMENT  (of  fhl*  Ropoet) 

APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED 


17.  DISTRIBUTION  STATEMENT  (ml  Uim  mbmitmet  mntmnd  In  BlocA  30,  II  dlllmrmni  Inm  RtpoH) 


IS.  supplementary  NOTES 


19.  KEY  WORDS  (Canllnum  on  tmrmmm  midm  II  nmcmmmmrr  mnd  lOmntllr  Of  bloc*  numbmr} 

Computers 
Statistics 


ABSTRACT  fCMmua  ■■  rmrmrmm  oB*  M immu  aaW  lOiHIT  Sr  aiocA  mmbmm) 

This  report  describes  an  automated  Generalized  Research  Analysis  Statistical 
System  (GRASS)  which  is  easy  to  use  and  which  will  satisfy  many  of  the  statis- 
tical computational  requirements  of  laboratory  investigators.  Instructions 
are  given  for  utilization  of  this  system  in  a manner  sufficiently  detailed 
to  be  usable  by  personnel  unfamiliar  with  computer  operations. 


00/, 


JAM  7> 


1473  EMTION  OF  * NOV  BB  IS  OOSOLETE 


UNCLASSIFIED 


Jr 

M OF  TNIB  PME  (Wtmn  Dim  SnIOM# 


security  classification  of  TNIB  PME  (Mmi 


TABLE  OF  CONTENTS  10/07/T6 


i 


1.  GRASS  1 

1.1  INTRODUCTION  1 

PREPARATION  OF  THE  PUNCHED  CARO  DECK  2 

1.2  PREPARATION  OF  THE  EXPERIMENTAL  DATA  CAROS  3 

1.3  GRASS  CONTROL  CAROS  8 

OVERVIEW  8 

INFORMATION  CONTROL  CAROS  9 

STATISTICAL  ROUTINE  SPECIFICATION  CONTROL  CAROS  11 

DATA  TRANSFORMATION  CAROS  - THE  $TRANS  STATEMENT  13 

1.4  SAMPLE  PROBLEMS  17 

A.  SAMPLE  JOB  DECKS  21 


23 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 


B.  DESCRIPTION  OF  INFORMATION  CONTROL  CAROS 
TITLE 

DESCRIBING  INPUT  DATA 
WRITING  SPECIFIED  VARIABLES 
COOED  FILES 
BINARY  FILES 
FORMAT  STATEMENT 
FORMFREE  INPUT  DATA 
COMMENT  CARD 
REWIND 

HISTOGRAM  SCALING 
PLOT  SCALING 

COMBINING  VARIABLES  - $A00 


END 


35 


TABLE  QF  CONTENTS  10/07/76 


ii 

C.  STATISTICAL  ROUTINE  SPECIFICATION  CAROS  36 

BASIC  STATISTICS  36 

NON-PAIREO  T-TEST  38 

PAIRED  T-TEST  AO 

WILCOXON»S  RANK  SUM  TEST  A2 

WILCOXONTS  SIGNED  RANKS  TEST  AS 

ONE  WAY  ANALYSIS  OF  VARIANCE  A8 

OUNCANtS  NEW  MULTIPLE  RANGE  TEST  51 

NEWMAN  - KEULtS  MULTIPLE  RANGE  TEST  54 

THE  KRUSKAL-WALLIS  STATISTIC  56 

MULTIPLE  COMPARISONS  BASED  ON  THE  KRUSKAL-WALLIS  H 59 
STATISTIC 

PEARSONtS  CORRELATION  61 

KENOALLtS  RANK  CORRELATION  6A 

CHI-SQUARE  67 

LEAST  SQUARES  REGRESSION  70 

MULTIPLE  REGRESSION  73 

STEP-WISE  MULTIPLE  REGRESSION  76 

POLYNOMIAL  REGRESSION  78 

NORMALITY  TEST  80 

HISTOGRAM  83 

PLOT  ROUTINE  8A 

PLOT  NORMAL  85 

PLOT  INVERTED  * 86 

F-OISTRIBUTION  87 

CHI-SQUARE  DISTRIBUTION  88 

T-OISTRIBUTION  89 


WiLCGXONfS  RANK  SUM  AND  SIGNED  RANKS  0ISTRI8UTI0NS 


90 


0.  FIGURES 


TABLE  OF  CONTENTS  10/07/76 


1 THRU  57 


2 


DISTOIBOTION  LIST 


1.1 


GRASS 

INTRODUCTION 


1 


1.1  INTRODUCTION 

NEARLY  EVERY  INVESTIGATOR  HAS  A NEED*  AT  ONE  TINE  OR  ANOTHERt  FOR 
STATISTICAL  COMPUTATIONS.  HE  HAY  USE  A SHALL  DESK  CALCULATOR  TO 
COMPUTE  MEANS  AND  STANDARD  DEVIATIONS  FOR  EACH  OF  HIS  VARIABLES. 
HOWEVEAt  AS  THE  COMPLEXITY  OF  THE  ANALYSES  INCREASES  OR  AS  THE  NUMBER 
OF  VARIABLES  AND/OR  THE  NUMBER  OF  OBSERVATIONS  INCREASESt  MANY  HOURS 
HAVE  TO  BE  SPENT  BEHIND  THE  CALCULATOR.  THIS  INCREASES  THE  CHANCE  FOR 
HUMAN  ERRORS  IN  THE  CALCULATIONS.  IN  ADDITION*  IT  DISTRACTS  THE 
RESEARCHER  AND/OR  HIS  TECHNICIANS  FROM  LABORATORY  WORK  AND  PERHAPS 
FROM  A MORE  COMPREHENSIVE  ANALYSIS  OF  THE  DATA.  AN  EASY  TO  USE 
AUTOMATED  SYSTEM  WOULD  BE  OF  GREAT  BENEFIT  TO  RESEARCHERS*  PROVIDED  IT 
WAS  IN  PRACTICE  EASY  TO  USE.  THIS  REQUIREMENT  IS  ONE  OF  THE  MOST 
IMPORTANT  SPECIFICATIONS  OF  SUCH  A SYSTEM.  SINCE  MOST  BIOMEDICAL 
RESEARCHERS  ARE  NOT  READILY  CONVERSANT  WITH  COMPUTER  SYSTEMS*  NOR  DO 
THEY  GENERALLY  HAVE  THE  TIME  TO  DEVOTE  TOWARDS  BECOMING  SO.  IN 
THEORY*  ALL  A RESEARCHER  SHOULD  HAVE  TO  DO  IS  TO  WRITE  HIS  DATA  ON  A 
STANDARD  FORM  IN  AN  ORGANIZED  MANNER*  SPECIFY  THE  STATISTICS  HE 
DESIRES.  AND  SUBMIT  HIS  PROBLEM  TO  A CENTRAL  FACILITY.  ALL  HIS 
COMPUTATIONS  WOULD  THEN  BE  HANDLED  BY  AN  AUTOMATED  SYSTEM. 

A GENERALIZED  RESEARCH  ANALYSIS  STATISTICAL  SYSTEM,  GRASS*  HAS 

BEEN  WRITTEN  BY  THE  DEPARTMENT  OF  INFORMATION  SCIENCES  TO  PROVIDE  THIS 
SUPPORT  TO  LABORATORY  INVESTIGATORS.  GRASS  HAS  THE  CAPABILITY  TO 
COMPUTE  A VARIETY  OF  DESCRIPTIVE  STATISTICS*  TO  PERFORM  MANY 
PARAMETRIC  AND  NON-PARAMETR IC  STATISTICAL  TESTS*  AND  TO  PRODUCE  PLOTS 
AND  HISTOGRAMS.  THE  DESCRIPTIVE  STATISTICS  INCLUDE  MEANS*  MEDIANS* 
STANDARD  DEVIATIONS,  STANDARD  ERRORS  OF  THE  MEAN*  MINIMA*  MAXIMA* 
RANGES*  PEARSON  CORRELATION  COEFFICIENTS  AND  KENOALLtS  RANK 
CORRELATION  COEFFICIENTS.  THE  STATISTICAL  TESTS  INCLUDE  PAIRED  AND 
NON-PAIRED  T- TESTS*  WILCOXONfS  RANK  SUM  AND  SIGNED  RANKS  TESTS* 

GENERAL  ONE-WAY  ANALYSIS  OF  VARIANCE,  OUNCANtS  NEW  MULTIPLE  RANGE 
TEST*  NEWMAN-KEULtS  RANGE  TEST,  KfiUSKAL-WALL  IS  TEST*  MULTIPLE 
COMPARISONS  BASED  ON  THE  KRUSKAL-WALLI S STATISTIC,  CHI-SQUARE  TESTS* 
TESTS  OF  SIGNIFICANCE  OF  KENOALLtS  CORRELATION  COEFFICIENT*  TESTS  OF 
SIGNIFICANCE  OF  SIMPLE  LINEAR*  MULTIPLE*  STEP-WISE*  AND/OR  POLYNOMIAL 
REGRESSIONS*  AND  NORMALITY  TESTS. 

GRASS  ALSO  HAS  A VARIETY  OF  DATA  TRANSFORMATION  AND  DATA 
MANIPULATION  CAPABILITIES  (I.E.*  NEW  VARIABLES  CAN  BE  CREATED  AS 
FUNCTIONS  OF  EXISTING  VARIABLES  OR  EXISTING  VARIABLES  CAN  BE 
REWRITTEN). 

ALL  THAT  IS  REQUIRED  OF  THE  INVESTIGATOR  IS  TO  PREPARE  A PUNCHED 
CARO  DECK  WHICH  CONSISTS  OF  THE  EXPERIMENTAL  DATA  TO  BE  ANALYZED  AND  A 
LIST  OF  THE  ANALYSES  TO  BE  PERFORMED.  FOR  USER  CONVENIENCE*  GRASS 
WILL  ACCEPT  THE  EXPERIMENTAL  DATA  FROM  INPUT  SOURCES  OTHER  THAN  CAROS. 
GRASS  ALSO  ALLOWS  THE  USER  TO  WAITE  VARIABLES  OF  HIS  CHOICE  TO  A 
VARIETY  OF  OUTPUT  STORAGE  FACILITIES. 
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1.  I 


PREPARATION  OP  THE  PUNCHED  CARO  DECK 


PREPARATION  OF  THE  EXPERIMENTAL  DATA  FOR  ANALYSIS  BY  THE  GRASS 
PROGRAM  REQUIRES  THAT  A DECK  OF  PUNCHED  CARDS  BE  ASSEMBLED.  A PUNCHED 
CARO  IS  A SPECIAL  PURPOSE  CARO  WHICH  CAN  BE  READ  BY  A DIGITAL  COMPUTER 
SYSTEM.  IT  HAS  60  SEQUENTIALLY  NUMBERED  CONTIGUOUS  COLUMNS.  DATA 
with  one  CHARACTER  PER  COLUMN  IS  PUNCHED  ONTO  THESE  CAROS.  INPUT  DATA 
ARE  PUNCHED  EITHER  IN  FIXED  OR  FORMAT-FREE  DATA  FIELDS.  A DATA  FIELD 
IS  ONE  OR  MORE  CONTIGUOUS  COLUMNS  UN  A SINGLE  CARO  WHICH  ARE  ASSIGNED 
TO  A VARIABLE. 

THE  GRASS  DECK  IS  ASSEMBLED  USING  THREE  TYPES  OF  CAROS  - SYSTEM 
CONTROL  CAROSi  GRASS  CONTROL  CAROSt  AND  EXPERIMENTAL  DATA  CAROS. 

THESE  CARDS  MUST  Bt  IN  A SPECIFIC  SEQUENCE  FOR  THE  GRASS  PROGRAM  TO 
WORK  PROPERLY.  THE  SYSTEM  CONTROL  CAROS  ARE  REQUIRED  BY  ALL  JOBS* 

THE  SYSTEM  CONTROL  CAROS  FOR  THE  COC  7600  BKV  OPERATING  SYSTEMt 
(MODIFIED  SCOPE).  NECESSARY  FOR  PROCESSING  A GRASS  RUN  WHEN 
EXPERIMENTAL  DATA  IS  INPUT  VIA  CAROS  ARE  LISTED  IN  APPENDIX  A. 

APPENDIX  A ALSO  CONTAINS  EXAMPLES  OF  THE  SYSTEM  CAROS  TO  USE  WHEN  THE 
INPUT  DEVICE  IS  OTHER  THAN  CAROS.  SINCE  THE  EXAMPLES  GIVEN  APPLY  ONLY 
TO  THE  COC  7600  BRV  OPERATING  SYSTEM.  THE  USER  SHOULD  CONSULT  A LOCAL 
COMPUTER  SPECIALIST  CONCERNING  THE  APPROPRIATE  SYSTEM  CAROS  TO  USE  FOR 
A PARTICULAR  MACHINE. 


1.2  GRASS  3 

PREPARATION  OF  THE  EXPERIMENTAL  DATA  CAROS 


1.2  PREPARATION  OF  THE  EXPERIMENTAL  DATA  CAROS 

THE  FINAL  FORM  OF  THE  INPUT  DATA  MAY  BE  REPRESENTED  SCHEMATICALLY 

AS 

VARIABLES 


xm 

XI2) 

XI3I....I 

1 

• 

Xll 

X12 

X13 

2 

• 

• 

X21 

X22 



3 

• 

* 

X31 

X32 

X33 

• 

• 

• 

• 

• 

• 

• 

* 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

* 

• 

• 

• 

• 

M 

• 

XMl 

XN2 

XM3...... 

MHERE  Xll  THROUGH  XIN  ARE  ALL  THE  VALUES  FOR  THE  FIRST  CASEt  , 

( SUBJECT).  AND  THERE  ARE  M CASES.  A CASE  MAY  BE  PUNCHED  ON  AS  MANY 

CAROS  AS  NECESSARY,  BUT  FOR  FIXED  FORMAT  DATA  A VALUE  FOR  A PARTICULAR 

VARIABLE  MUST  BE  IN  THE  SAME  LOCATION  FROM  CASE  TO  CASE.  THAT  IS  IF  | 

EACH  CASE  REQUIRED  2 CAROS  AND  THE  FIRST  CASEtS  WEIGHT  WAS  IN  COLUMNS  j 

l-A  OF  THE  first  CARO  AND  THE  FIRST  CASE«S  HEIGHT  WAS  IN  COLUMNS  10-13  ! 

OF  THE  SECOND  CARO,  ALL  SUBSEQUENT  CASES  MUST  HAVE  WEIGHT  PUNCHED  IN  I 

COLUMNS  1-4  OF  A CASEtS  FIRST  CARO  AND  HEIGHT  PUNCHED  IN  COLUMNS  10-13 

OF  A CASEtS  SECOND  CARO.  FOR  FORMAT-FREE  INPUT  DATA,  VARIABLES  MUST 

BE  IN  THE  SAME  ORDER  FROM  CASE  TO  CASE.  | 

THE  REST  OF  THE  DISCUSSION  IN  THIS  SECTION  APPLIES  TO  DATA  SET  UP  ; 

IN  FIXED  FORMAT.  FOR  SETTING  UP  FORMAT  FREE  DATA  SEE  THE  DISCUSSION  I 

OF  THE  AFMFREE  CARO  IN  APPENDIX  B.  - 

TO  PREPARE  THE  EXPERIMENTAL  DATA  CARDS  IN  THE  GENERAL  FORM  ABOVE,  | 

SIX  STEPS  ARE  NECESSARY.  THESE  STEPS  MUST  BE  FOLLOWED  CAREFULLY  IF  ' 

THE  DESIRED  RESULTS  ARE  TO  BE  ATTAINED.  THE  STEPS  ARE?  ] 

1.  SEQUENTIALLY  NUMBER  EACH  VARIABLE.  | 

2.  DETERMINE  THE  MAXIMUM  FIELD  WIDTH,  NUMBER  OF  CHARACTERS  OF 
EACH  VARIABLE.  THIS  WILL  BE  THE  MINIMUM  DATA  FIELD 
SPECIFICATION. 

3.  LAYOUT  THE  DATA  FIELDS  FOR  EACH  VARIABLE  ON  STANDARD  80 
COLUMN  DATA  SHEETS. 

i 

4.  DETERMINE  IF  THE  DECIMAL  POINT  IS  TO  BE  PUNCHED  OR  LOCATE  | 

THE  POSITION  OF  THE  DECIMAL  IN  THE  DATA  FIELD  IF  THE 

DECIMAL  POSITION  IS  TO  BE  IMPLIED  THROUGH  THE  FORMAT 
STATEMENT. 

5.  TABULATE  THE  RAW  DATA  IN  THE  APPROPRIATE  DATA  FIELD  ON  THE 
80  COLUMN  DATA  SHEETS. 
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6.  OESCRIBF  THE  LAYOUT  OF  THE  DATA  FIELDS  IN  A FORMAT 

statement. 

1.  SEQUENTIALLY  NUMBER  EACH  VARIABLE. 

EACH  OF  THE  VARIABLES  IN  THE  STUDY  MUST  BE  NUMBERED  SEQUENTIALLY. 
THE  VARIABLE  NUMBER  WILL  BE  USED  TO  IDENTIFY  THE  VARIABLE  IN  THE  GRASS 
PROGRAM. 

2.  DETERMINE  THE  MINIMUM  FIELD  MIOTH  FOR  EACH  VARIABLE. 

THE  total  NUMBER  OF  CHARACTERS  NECESSARY  FOR  A NUMERICAL 
OBSERVATION  INCLUDES  THE  DIGITS,  THE  DECIMAL  POINT,  THE  ALGEBRAIC 
SIGN,  AND  WHEN  NECESSARY  AN  INDICATOR  WHICH  SPECIFIES  THE  EXPONENT  OF 
BASE  TEN  BY  WHICH  THE  DECIMAL  DIGITS  ARE  TO  BE  MULTIPLIED.  IF  NO  SIGN 
IS  PUNCHED  THE  SYSTEM  WILL  ASSUME  THE  NUMBER  IS  POSITIVE.  THE  DECIMAL 
POINT  MAY  BE  IMPLIED  THROUGH  THE  USE  OF  A FORMAT  STATEMENT,  AS  WILL  BE 
OcSCRIBtD  LATER.  THE  MINIMUM  FIELD  WIDTH  MUST  ALLOW  FOR  ALL  CHARACTERS 
WHICH  ARE  NEEDED  TO  DESCRIBE  A VARIABLE  FOR  ALL  THE  CASES.  A FIELD 
WIDTH  GREATER  THAN  THE  MINIMUM  .MAY  BE  USED.  UNUSED  CHARACTERS  MUST  BE 
BLANKS  OR  ZEROES.  USUALLY  BLANKS  PRECEDE  THE  FIRST  CHARACTER  OF  A 
NUMBER  AND  ZEROES  ARE  USED  AFTER  THE  FIRST  CHARACTER  HAS  BEEN 
INDICATED.  AN  ENTIRE  FIELD  WHICH  IS  LEFT  BLANK  IS  CONSIDERED  A 
MISSING  DATA  POINT  BY  THE  GRASS  SYSTEM.  A FIELD  IN  WHICH  THE  VALUE  OF 
A VARIABLE  IS  ZERO  MUST  HAVE  A ZERO  PUNCHED  IN  IT. 

3.  LAYOUT  THE  DATA  FIELD  FOR  EACH  VARIABLE  ON  80  COLUMN  DATA  SHEETS. 

LAIR  USERS  MAY  OBTAIN  STANDARD  80  COLUMN  DATA  SHEETS  FROM  THE 
DEPARTMENT  OF  INFORMATION  SCIENCES.  OTHER  USERS  SHOULD  CONTACT  THEIR 
LOCAL  COMPUTER  FACILITY  TO  OBTAIN  THESE  FORMS.  EACH  COLUMN  ON  A SHEET 
CORRESPONDS  TO  A COLUMN  ON  AN  80  COLUMN  HOLLERITH  CARO.  A KEYPUNCH 
OPERATOR  WORKS  DIRECTLY  FROM  THE  COOING  SHEETS  IN  PREPARING  THE 
PUNCHED  card  deck.  THE  DATA  SHEET  IS  DIVIDED  INTO  DATA  FIELDS  WITH 
ONE  FIELD  FOR  EACH  VARIABLE.  DATA  FIELDS  MAY  BE  SEPARATED  BY  BLANK 
FIELDS.  FOR  CLARIFICATION,  DATA  FIELDS  MAY  BE  EMPHASIZED  BY  DRAWING 
HEAVY  VERTICAL  LINES  ON  THE  SHEETS  TO  SEPARATE  THE  FIELDS.  ENOUGH 
SHEETS  ARE  LAID  OUT  SO  THAT  EACH  VARIABLE  FOR  A SINGLE  CASE  IS 
ASSIGNED  AN  ADEQUATELY  SIZED  DATA  FIELD. 

4.  DETERMINE  IF  THE  DECIMAL  POINT  IS  TO  BE  PUNCHED  OR  LOCATE  THE 
POSITION  OF  THE  DECIMAL  IN  THE  DATA  FIELD  IF  IT  IS  IMPLIED  IN 
THE  FORMAT  STATEMENT. 

PUNCHING  OF  A DECIMAL  POINT  IS  OPTIONAL  BECAUSE  A FORMAT 
STATEMENT  DESCRIBES  EACH  DATA  FIELD  IN  DETAIL  INCLUDING  THE  LOCATION 
OF  THE  DECIMAL  POINT.  THE  NUMBER  3.1416  COULD  BE  ENTERED  ON  THE  DATA 
SHEET  AS  31416  AND  COULD  BE  READ  BY  THE  GRASS  PROGRAM  AS  3.1416  IF  SO 
SPECIFIED  IN  THE  FORMAT  STATEMENT.  WHEN  DECIMAL  POINTS  ARE  NOT 
PUrCHEO,  DATA  MUST  BE  ALIGNED  IN  THE  DATA  FEILD  SO  THAT  THE  FIRST 
DIGIT  FOLLOWING  THE  IMPLIED  DECIMAL  POSITION  IS  ALWAYS  ENTERED  IN  THE 
SAME  COLUMN  OF  THE  FIELD. 
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5.  TABULATE  THE  RAM  DATA  IN  APPROPRIATE  FIELDS. 

ENTER  ALL  DATA  FOR  ONE  CASE  ON  A HORIZONTAL  LINE  ON  ONE  OR  MORE 
DATA  SHEETS  WHICH  HAVE  dEEN  PREPARED.  SUCCESSIVE  CASES  ARE  ENTERED  ON 
SUBSEQUENT  LINES  IN  TABULAR  FASHION.  WHEN  A SINGLE  SET  OF  DATA  SHEETS 
ARE  EXHAUSTED,  NEW  SETS  OF  SHEETS  ARE  USED  WITH  THE  EXACT  FIELD 
SPECIFICATIONS  OF  THE  FIRST  SET. 

6.  DESCRIBE  THE  LAYOUT  OF  THE  DATA  FIELDS  IN  A FORMAT  STATEMENT 

THE  FORMAT  STATEMENT  IS  A DESCRIPTION  OF  THE  WAY  IN  WHICH  THE 

DATA  SHEETS  ARE  LAID  OUT  AND  CONSEQUENTLY,  THE  WAY  IN  WHICH  THE  DATA 

WILL  APPEAR  ON  PUNCHED  CAROS.  THE  GRASS  FORMAT  STATEMENT  BEGINS  WITH 
A DOLLAR  SIGN,  B,  IN  COLUMN  1,  FOLLOWED  BY  THE  WORD  FORNAT  IN  COLUMNS 
2-7.  following  this  IS  A STANDARD  FORTRAN  IV  FORMAT  STATEMENT.  THE 

SPECIFICATIONS  WHICH  MAY  BE  USED  ARE  F,  E,  ANO/OR  X.  NUMERICAL  DATA 

MAY  BE  OF  EITHER  F OR  E TYPE.  F OR  E TYPE  DATA  ARE  READ  USING  F OR  E 
TYPE  SPcCIF [CATIONS  RESPECTIVELY.  BOTH  TYPES  OF  DATA  CONSIST  OF  ONE 
OR  MORE  DIGITS  WITH  A SINGLE  DECIMAL  PCINT  AND  A SIGN.  AN  E TYPE 
NUMBER  DIFFERS  FROM  AN  F TYPE  NUMBER  IN  THAT  IT  ALSO  HAS  AN  EXPONENT 
FOLLOWING  THE  DIGITS.  THE  EXPONENT  IS  THE  LETTER  -E-  FOLLOWED  BY  AN 
OPTIONAL  SIGN  POSITION  FOLLOWED  BY  ONE  OR  TWO  DIGITS.  THE  MEANING  OF 
THE  EXPONENT  IS  THAT  THE  PRECEDING  NUMBER  IS  TO  BE  MULTIPLIED  BY  TEN 
RAISED  TO  THE  POWER  GIVEN  BY  THE  SIGN  AND  THE  FOLLOWING  OIGI«S,  (I.E. 
2.5E10»2.5  TIMES  10  TO  THE  IOTH  POWERI.  SOME  EXAMPLES  OF  F AND  E TYPE 
NUMBERS  AND  THE  NUMERICAL  VALUES  THEY  REPRESENT  ARE 


NUMERIC 

VALUE 

E TYPE 

F TYPE 

200 

2.00E02 

200.0 

-10.5 

-1.05E01 

-10.5 

.0033 

3.3E-03 

.0033 

THE  GENERAL  FORM  OF  AN  F OR  E TYPE  SPECIFICATION  IS  NCW.O  WHERE 
N - NUMBER  JF  SUCCESSIVE  REPETITIONS  OF  THIS  FIELD 
C - TYPE  OF  SPECIFICATION,  F OR  E 
W - AN  INTEGER  SPECIFYING  THE  WIDTH  OF  THE  FIELD 

INCLUDING  DECIMAL  POINT  (IF  ANYI  AND  SIGN  ( LF  ANYI 
D - NUMBER  OF  DECIMAL  PLACES  UNDERSTOOD  TO  BE  IN  THE  FIELD. 

SOME  examples  ARE 


NUMERIC 

VALUE 

INPUT 

DATA 

SPECIFICATION 

22.5 

22.5 

F4.  1 

999.9 

9999 

F4.1 

99.99 

9999 

F4.2 

9.999 

9999 

F4.3 

-.00001 

-l.OE-05 

E8.1 

155.7 

1.557E02 

E8.3 

1000 

10.00E02 

E6.2 

IT  CAN  BE  SEEN  IN  THE  ABOVE  THAT  -W-,  THE  WIDTH  OF  THE  FIELD  INCLUDES 
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THE  DECIMAL  POINT  AND  SIGN  IF  THESE  ARE  PRESENT.  FURTHER  BY  CHOOSING 
AN  APPROPRIATE  VALUE  FOR  -0>»  THE  DECIMAL  POINT  CAN  BE  IMPLIED  WHEN 
USING  F TYPE  NUMBERS.  A DECIMAL  POINT  IS  USUALLY  PUNCHED  WHEN  THE  RAW 
DATA  IS  OF  THE  E TYPE.  A USEFUL  RULE  TO  REMEMBER  IS  THAT  A PUNCHED 
decimal  POINT  OVERRIDES  THE  -0-  CHARACTER  SPECIFIED  IN  THE  FORMAT. 

THUS  IF  THE  NUMBER  22.5  IS  PUNCHED.  ANY  OF  THE  FORMATS  F4.4.  F4.3t 
F4.2t  F4.1.  OR  F4.0  WOULD  CAUSE  THE  VALUE  22.5  TO  BE  READ. 

THE  X SPECIFICATION  IS  USED  TO  SKIP  OVER  BLANKS  OR  NUMERIC  DATA. 
THE  GENERAL  FORM  OF  THE  X SPECIFICATION  IS  NX  WHERE  N REPRESENTS  THE 
number  OF  COLUMNS  TO  BE  SKIPPED.  THUS  lOX  WOULD  CAUSE  10  COLUMNS  TO 
BE  SKIPPED. 

ELEMENTS  OF  A FORMAT  STATEMENT  ARE  SEPARATED  BY  COMMAS.  IF  THE 
VARIABLES  FOR  A CASE  ARE  PUNCHED  ON  MORE  THAN  ONE  CARO.  A SLASH.  I/). 
INDICATES  THAT  A NEW  CARO  IS  TO  BE  READ.  SINCE  THE  FORMATS  FOR  ALL 
CASES  ARE  IDENTICAL.  ONLY  ONE  SFQRMAT  CARO  IS  USED.  THE  GRASS  PROGRAM 
HILL  READ  AS  MANY  CASES  AS  ARE  SPECIFIED  ON  THE  SOATA  CARO. 

EXAMPLES  OF  GRASS  FORMAT  STATEMENTS 

1.  IN  A HYPOTHETICAL  STUDY.  4 VARIABLES  ARE  MEASURED.  ' THE  VALUES 

FOR  THE  FIRST  CASE  ARE  3S.5.  T99.  81000.  AND  2.  THESE  MIGHT  BE 
PUNCHED  ON  CARDS  AS 

11111111122  8 
COLUMN  12345678901234567890-^0 

VALUES  355  79981000  2 


USING  THE  F AND  X SPECIFICATIONS  THIS  DATA  CCULO  BE  READ  USING  THE 
FORMAT  STATEMENT 

SFORMATf2X.F3.1«2X»F3.0fF5.0t4XfF1.0l. 

IF  WE  CONSIDER  HOWEVER  THAT  ALL  NUMBERS  ARE  IN  FIELDS  OF  5.  (THE  FIRST 
AND  SECOND  NUMBER  HAVE  2 LEADING  BLANKS.  AND  THE  FOURTH  NUMBER  HAS  4 
LEADING  8LANKSI*  WE  SEE  THAT  THE  INPUT  DATA  COULD  BE  READ  WITH  THE 
FORMAT  STATEMENT 

$F0RMATIF5.1»F5.0*F5.0»F5.0I. 

THIS  FORMAT  STATEMENT  COULD  BE  FURTHER  CONTRACTED.  NOTE  THAT  THE  LAST 
THREE  ELEMENTS  ARE  F5.0.  THESE  CAN  BE  COMBINED  AS  3F5.0  (THREE 
SUCCESIVE  FIELDS  IN  Fi.D  FORMAT  ARE  TO  BE  READ!.  THUS  THE  FORMAT 
STATEMENT  COULD  BE  REWRITTEN  AS 

$F0RNAT(F5.1,3F5.0I. 

THIS  IS  A MUCH  SHORTER  FORM  THAN  THE  FIRST  FORMAT  STATEMENT  BUT 
CONTAINS  THE  SAME  INFORMATION  AND  WILL  RESULT  IN  THE  SAME  NUMBERS. 
35.5.  799.  81000  AND  2 BEING  READ. 


2.  A HYPOTHETICAL  STUDY  INVOLVES  THE  MEASUREMENT  OF  5 VARIABLES. 

THE  VALUES  FOR  THE  FIRST  CASE  ARE  .211.  .333.  2100.  57.8.  AND  100. 


I 

I 

L 


1.2  GRASS 

RREPARATICN  QF  THE  EXPERIMENTAL  DATA  CAROS 


THESE  VALUES  MIGHT  BE  ARRANGED  AND  PUNCHED  AS 

1111111111222222 
COLUMN  1234567890123456789012345- 

• 211  .333  2100  578  100 

THE  READER  SHOULD  CONVINCE  HIMSELF  THAT  ANY  OF  THE  FOLLOWING 
STATEMENTS  WILL  RESULT  IN  THE  VALUES  •211f  .333»  2100*  57.8> 
BEING  READ. 

SF0RNATIlXtF4.3flX«F4.3*lX«F4.0t2X,F3.1t2XtF3.01 

$F0RNAT«2ClX.F4.3ltlXtF4.0*2X.F3.1t2XtF3.01 

$F0RMATfF5.3»F5.3tF5.O.F5.1.F5.0l 

$F0RMATC2F5.3tF5.0«F5.1tF5.0) 

SF0RNAT|F5.0*F5.0«F5.0tF5.1»F5.0) 

iF0RMATI3F5.0*F5.1tF5.0l 


8 
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FORMAT 
AND  100 
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1.3  GRASS  CONTROL  CAROS 


OVERVIEW 


THERE  ARE  THREE  TYPES  OF  GRASS  CONTROL  CAROS  - INFORMATION 
CONTROL  CAROSt  STATISTICAL  ROUTINE  SPECIFICATION  CAROSt  ANO  OATA 
transformation  CAROS.  THE  INFORMATION  CONTROL  CAROS  SPECIFY  GENERAL 
INFORMATION  TO  THE  GRASS  PROGRAM,  SUCH  AS  THE  NUMBER  OF  VARIABLES  ANO 
OBSERVATIONS  ANO  THE  FORMAT  OF  THE  INPUT  OATA.  THE  STATISTICAL 
ROUTINE  SPECIFICATION  CARDS  SPECIFY  WHICH  STATISTICAL  ROUTINES  ARE  TO 
BE  USED  IN  THE  ANALYSES.  THE  DATA  TRANSFORMATION  CAROS  SPECIFY  THE 
TYPES  OF  OATA  TRANSFORMATIONS  ANO  MANIPULATIONS  THE  USER  DESIRES  TO 
PERFORM  ON  THE  OATA. 

ALL  GRASS  CONTROL  CAROS  HAVE  A DOLLAR  SIGN,  $,  PUNCHED  IN  COLUMN 
1,  FOLLOWED  BY  A GRASS  CONTROL  WORD  PUNCHED  IN  COLUMNS  2 THROUGH  7, 
FOLLOWED  BY  A PARAMETER  FIELD  PUNCHED  IN  COLUMNS  8 THROUGH  79. 

•MOTE  THE  FOLLOWING  IN  THE  PREPARATION  OF  INFORMATION  CONTROL  CARDS 
ANO  STATISTICAL  ROUTINE  SPECIFICATION  CAROS- 

II  IF  THE  PARAMETER  FIELD  HAS  TO  BE  CONTINUED  ON  SUBSEQUENT  CAROS,  THE 
LETTER  -C-  MUST  BE  PUNCHED  IN  COLUMN  80  OF  THE  CURRENT  CARO. 

21  WHEN  A CONTINUATION  CARD  IS  NECESSARY  THE  CARO  LABEL,  CONTROL  WORD, 
MUST  NOT  BE  INCLUDED  ON  THE  CONTINUATION  CARO. 

31  THE  PARAMETER  FIELDS  OF  CERTAIN  CAROS  MAY  NOT  BE  CONTINUED  ONTO 
SUBSEQUENT  CARDS.  THIS  LIMITATION  IS  NOTED  FOR  AFFECTED  CONTROL  CAROS 
IN  APPENDICES  B ANC  C. 
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INFORMATION  CONTROL  CAROS 


) THE  FOLLOWING  IS  AN  INDEX  OF  GRASS  INFORMATION  CONTROL  CAROS. 

THESE  CAROS  ARE  DESCRIBED  IN  DETAIL  IN  APPENDIX  B. 

ATITLE  A TITLE  OF  THE  USERfS  CHOICE  IS  PUNCHED  IN  THE  PARAMETER 

FIELD.  THIS  TITLE  APPEARS  AT  THE  TOP  OF  EACH  PAGE  OF 
OUTPUT*  UNTIL  ANOTHER  STITLE  CARO  IS  USED. 

tOATA  THE  PARAMETER  FIELD  INCLUDES  THE  NUMBER  OF  VARIABLES* 

NUMBER  OF  OBSERVATIONS*  AND  AN  INDICATION  OF  WHETHER  THE 
INPUT  DATA  IS  TO  BE  PRINTED  IN  THE  OUTPUT, 

SFORNAT  THE  PARAMETER  FIELD  IS  A FORTRAN  FORMAT  STATEMENT  IN  WHICH 

F*  E*  OR  X TYPE  SPECIFICATIONS  MAY  BE  USED  TO  READ  INPUT 
DATA  WHICH  ARE  PUNCHED  OR  WRITTEN  IN  FIXED  FORMAT. 

iFNFREE  THERE  IS  NO  PARAMETER  FIELD.  DATA  PUNCHED  IN  A 

FORMAT-FREE  MANNER*  (EACH  VALUE  HAS  THE  DECIMAL  POINT 
PUNCHEC  AND  VALUES  ARE  SEPARATED  BY  BLANK  OR  COMMAS  I*  WILL 
BE  READ. 

$MRITE  THE  PARAMETER  FIELD  INCLUDES  A SPECIFICATION  OF  WHICH 

VARIABLES  ARE  TO  BE  WRITTEN. 

SFILE  THE  PARAMETER  FIELD  IS  A 1 THROUGH  7 CHARACTER  NAME.  IF 

IFILE  IS  USED  AFTER  THE  iOATA  CARO*  THE  FORMATTED  INPUT 
DATA  WILL  BE  READ  FROM  THE  FILE  NAMED  IN  THE  PARAMETER 
FIELD.  IF  THE  $FILE  CARO  IS  USED  AFTER  THE  $WRITE  CARO* 
THE  VARIABLES  SPECIFIED  ON  THE  iWRITE  CARO  WILL  BE  WRITTEN 
TO  THE  FILE  NAMED  OR  TO  BCOOWT  IF  THE  PARAMETER  FIELD  ON 
THE  SFILE  CARO  IS  LEFT  BLANK.  A SFORMAT  CARO  MUST  FOLLOW 
THE  SFILE  CARD.  THE  SFILE  CARO  IS  NCT  USED  WHEN  INPUT 
DATA  IS  ON  CAROS. 

tFILEB  THE  PARAMETER  FIELD  INCLUDES  A 1 THROUGH  7 CHARACTER  NAME. 

IF  USED  AFTER  THE  SOATA  CARO*  THE  NON-FORMATTED  INPUT  DATA 
WILL  BE  READ  FROM  THE  FILE  NAMED.  IF  USED  AFTER  THE 
SWRITE  CARO*  THE  VARIABLES  SPECIFIED  ON  THE  SWRITE  CARO 
WILL  BE  WRITTEN  IN  NON-FORMATTED*  (BINARYI*  FORM  TO  THE 
file  named.  no  SFORMAT  CARO  SHOULD  FOLLOW  THE  SFILEB 
CARO.  THE  SFILEB  CARD  IS  NOT  USED  WHEN  THE  INPUT  DATA  IS 
ON  CARDS  IN  NON-FORMATTEO  FORM, 

$CMT  THE  PARAMETER  FIELD  INCLUDES  ANY  COMMENT  THE  USER  WISHES 

TO  MAKE.  THE  COMMENT! SI  WILL  BE  PRINTED  IN  THE  OUTPUT. 

•RMO  THE  PARAMETER  FIELD  INCLUDES  A I THROUGH  7 CHARACTER  NAME. 

THE  FILE  NAMED  WILL  BE  REWOUND. 

SSCALE  THE  PARAMETER  FIELD  SPECIFIES  THE  SCALE  TO  BE  USED  IN 

SETTING  UP  THE  AXES  FOR  A PLOT  OR  HISTOGRAM, 

tAOO  THE  PARAMETER  FIELD  SPECIFIES  TWO  VARIABLES  WHICH  ARE  TO 

BE  COMBINED  AND  A THIRD  VARIABLE  WHICH  WILL  CONTAIN  THE 
COMBINED  VARIABLES. 


1.3 


GRASS 

GRASS  CONTROL  CARDS 


10 


$ENO  THIS  IS  THE  LAST  GRASS  CONTROL  CARO  OF  EVERY  DECK. 

ATITLE.SOATA  AND  SEND  ARE  REQUIRED  ON  ALL  GRASS  RUNS. 

CONDITIONAL  CAROS  ARE  AS  FOLLOi^S— 

II  SFORNAT  MUST  BE  USED  WHEN  INPUT  DATA  IS  ON  CAROS  IN  FORMATTED  FORM 
OR  WHEN  THE  SFILE  CARO  IS  USED. 

21  SFILE  MUST  BE  USED  WHEN  INPUT  DATA  IS  IN  FORMATTED  FORM  ON  A SOURCE 
OTHER  THAN  CAROS. 

31  SFMFREE  MUST  BE  USED  WHEN  INPUT  DATA  IS  ON  CAROS  IN  NON-FORMATTED 
FORM. 

41  SFILEB  MUST  BE  USEC  WHEN  INPUT  DATA  IS  IN  NON-FORMATTEOt  BINARY# 
FORM  ON  A SOURCE  OTHER  THAN  CARDS.  ALL  OTHER  INFORMATION  CONTROL 
CAROS  ARE  OPTIONAL. 

APPENDICES  A AND  0 CONTAIN  SOME  SAMPLE  JOB  SETUPS. 
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STATISTICAL  ROUTINE 


THE  FOLLOWING  IS  AN  INDEX 

SPECIFICATION  CONTROL  CAROS.  A THOROUGH  DESCRIPTION  OF  THESE  CAROS 

AND  THE  ANALYSES  PERFORHEO  ARE  PRESENTED  IN  APPENDIX  C. 

iBSTAT  THE  NUMBER  OF  OBSERVATIONSt  MINIMUM*  MAXIMUM*  RANGE* 

MEDIAN*  MEAN*  STANDARD  DEVIATION*  AND  THE  STANDARD  ERROR 
OF  THE  MEAN  ARE  PRINTED  FOR  EACH  VARIABLE  REQUESTED. 

SNTST  NCN-PAIREO  T-TESTS  ARE  PERFORMED  BETWEEN  PAIRWISE 

combinations  of  all  VARIABLES  SPECIFIED  IN  THE  PARAMETER 

field, 

iPTST  PAIRED  T-TESTS  ARE  PERFORMED  BETWEEN  PAIRWISE  COMBINATIONS 

OF  ALL  VARIABLES  SPECIFIED. 

SRNKSM  MiLCUXCNtS  RANK  SUM  STATISTIC  IS  CALCULATED  FOR  THE  2 

VARIABLES  SPECIFIED.  THE  LARGE  SAMPLE  APPROXIMATION  OF  THE 
RANK  SUM  STATISTIC  IS  ALSO  PRINTED.  IN  ADDITION*  THE 
SIEGEL-TUKEY  TEST  STATISTIC  IS  PRINTED. 

SSNRNK  WiLCOXCNtS  SIGNED  RANK  STATISTIC  AND  ITS  LARGE  SAMPLE 

APPROXIMATION  ARE  COMPUTED  FOR  THE  2 VARIABLES  SPECIFIED. 

$ANOVA  A ONE-WAY  ANALYSIS  OF  VARIANCE  FOR  EQUAL  OR  UNEQUAL 

NUMBERS  OF  REPLICATES  IS  PERFORMED  FOR  THE  VARIABLES 
REQUESTED. 

SONMRT  DUNCAN »S  NEW  MULTIPLE-RANGE  TEST  IS  USED  TO  COMPARE  THE 

MEANS  OF  THE  VARIABLES  SPECIFIED. 

SNKMRT  NEWMAN-KEULtS  TEST  IS  USED  TO  COMPARE  THE  MEANS  OF  THE 

VARIABLES  SPECIFIED. 

SKRUML  THE  KRUSKAL-WALL IS  H STATISTIC  FOR  TESTING  THE  HYPOTHESIS 

OF  NO  DIFFERENCES  AMONG  K TREATMENTS  IS  CALCUALTEO  FOR  THE 
K VARIABLES  SPECIFIED. 

SKRUUC  THE  OUTPUT  INCLUDES  THE  KRUSKAL- WALL  I S H STATISTIC  AND 

ALSO  INCLUDES  A MULTIPLE  COMPARISON  TEST  BETWEEN  ALL 
PAIRWISE  COMBINATIONS  OF  THE  K VARIABLES  SPECIFIED. 

iCORR  PEARSON  PRODUCT  MOMENT  CORRELATION  COEFFICIENTS  ARE 

COMPUTED  FOR  ALL  PAIRWISE  CCM8INATIONS  OF  VARIABLES 
REQUESTED. 

SRNKCR  KENOALLtS  RANK  CORRELATION  COEFFICIENTS  ARE  COMPUTED  FOR 

ALL  PAIRWISE  COMBINATIONS  OF  VARIABLES  REQUESTED. 

SCHISQ  THE  CHI-SQUARE  STATISTIC  AND  CONTINGENCY  COEFFICIENT  ARE 

CALCULATED  FOR  A TWO-WAY  CONTINGENCY  TABLE. 

SLSQR  A LEAST-SQUARES  REGRESSION  ANALYSIS  IS  PERFORMED  FOR  A 

SPECIFIED  DEPENDENT  VARIABLE  AND  A SPECIFIED  INDEPENDENT 
VARIABLE. 
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(NRE6  A MULTIPLE  REGRESSION  ANALYSIS  IS  PERFORMED  FOR  A 

SPECIFIED  DEPENDENT  VARIABLE  AND  SPECIFIED  INDEPENDENT 
VARIABLES. 

$SWREG  A STEPNISE  MULTIPLE  REGRESSION  ANALYSIS  IS  PERFORMED  FOR  A 

SPECIFIED  DEPENDENT  VARIABLE  AND  SPECIFIED  INDEPENDENT 
VARIABLES. 

E 

' SP0LR6  A polynomial  REGRESSION  ANALYSIS  IS  PERFORMED  ON  THE 

DEPENDENT  AND  INDEPENDENT  VARIABLES  REQUESTED.  A 
POLYNOMIAL  OF  DEGREE  FIVE  OR  LESS  MAY  BE  FITTED. 

$NORNT  VARIOUS  TESTS  OF  NORMALITY  ARE  PERFORMED  ON  EACH  VARIABLE 

REQUESTED.  THESE  INCLUDE  A CHI-SQUARE  GOODNESS  OF  FIT 
TEST,  AND  TESTS  OF  SKEWNESS  AND  KURTOSIS. 

SPLOT  A PLOT  IS  ACCOMPLISHED  BETWEEN  2 VARIABLES. 

SPLOTN  A PLOT  OF  A SPECIFIED  DEPENDENT  VARIABLE  VERSUS  EACH 

SPECIFIED  INDEPENDENT  VARIABLE  IS  OUTPUT. 

BPLOTI  A PLOT  OF  A SPECIFIED  INDEPENDENT  VARIABLE  VERSUS  EACH 

SPECIFIED  DEPENDENT  VARIABLE  IS  OUTPUT. 


BHSTGM  A HISTCGRAM,  FREQUENCY  DISTRIBUTION,  IS  PRINTED  FOR  EACH 

VARIABLE  REQUESTED. 

BFOIST  THE  LEVEL  OF  SIGNIFICANCE  OF  THE  INPUT  F-VALUE  IS  PRINTED. 

tCOIST  THE  LEVEL  OF  SIGNIFICANCE  OF  THE  INPUT  CHI-SQUARE  VALUE  IS 

PRINTED. 

BTOIST  THE  LEVEL  OF  SIGNIFICANCE  OF  THE  INPUT  T-VALUE  IS  PRINTED. 


BRSOST  THE  LEVEL  OF  SIGNIFICANCE  OF  EITHER  THE  INPUT  WILCOXONtS 

RANK  SUM  STATISTIC  OR  THE  INPUT  WILCOXONtS  SIGNED  RANK 
STATISTIC  IS  PRINTED. 


I 
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DATA  TRANSFORMATION  CAROS 


THE  STRANS  Sf ATENENT 


THE  STRANS  CAROS  GIVE  THE  GRASS  USER  A GREAT  DEAL  OF  FLEXIBILITY 
IN  TRANSFORMING  ANO/OR  EDITING  THE  EXPERIMENTAL  DATA. 

THE  CONTROL  STATEMENT  BEGINS  WITH  A DOLLAR  SIONt  At  IN  COLUMN  1 
FOLLOMEO  BY  THE  CONTROL  WORD  TRANS  IN  COLUMNS  2 THROUGH  6.  THE 
^ OARAMETER  FIELD  IN  COLUMNS  6-79  CONTAINS  A TRANSFORMATION  STATEMENT. 
THERE  ARE  FOUR  TYPES  OF  TRANSFORMATION  STATEMENTS  — THE  VARIABLE 
STATEMENTt  THE  TRANSPOSE  STATEMENT,  THE  ASSIGNMENT  STATEMENT,  AND  THE 
CONOiriCNAL  STATEMENT. 

I.  THE  VARIABLE  STATEMENT  ALLOWS  THE  USER  TO  ASSIGN  A NAME  TO 
THE  INPUT  VARIABLES.  NORMALLY  AN  INPUT  VARIABLE  IS  REPRESENTED  BY  THE 
SUBSCRIPTED  VARIABLE  X,(I.E.  VARIABLE  1 IS  XIl),  VARIABLE  LO  IS 
XIIOH.  THE  VARIABLE  STATEMENT  ALLOWS  THE  USER  TO  CHANGE  THE  NAME,X, 
TO  ANY  NAME.  THE  NEW  NAME  IS  NOT  OUTPUT  AS  A LABEL.  AFTER  SUPPLYING 
A NEW  NAME  WITH  THE  VARIABLE  STATEMENT  EITHER  THE  NEW  NAME  OR  X MAY  BE 
USED  IN  SUBSEQUENT  TRANSFORMATION  STATEMENTS.  THIS  ALLOWS  SOME 
DIFFERENTIATION  OR  CLARIFICATION  IN  THE  SUBSEQUENT  STATEMENTS. 

EXAMPLES 

STRANS  VARIABLE  DATA 

INPUT  VARIABLE  1 COULD  BE  CALLED  OATA(l).  INPUT  VARIABLE  10  COULD  BE 
CALLED  OATA(IO). 

STRANS  VARIABLE  CURVE 

INPUT  VARIABLE  1 COULD  BE  CALLED  CURVE! II.  INPUT  VARIABLE  10  COULD  BE 
CALLED  CURVEIIO). 


2.  THE  TRANSPOSE  TRANSFORMATION  STATEMENT  ALLOWS  THE  USER  TO 
INTERCHANGE  THE  ROWS  AND  COLUMNS  OF  THE  INPUT  DATA.  THIS  EFFECTIVELY 
INTERCHANGES  OBSERVATIONS  (ROWS)  AND  VARI ABLES ( COLUMNS) . 

EXAMPLE 

STRANS  TRANSPOSE 

IF  THE  DATA  WAS  INPUT  WITH  3 VARIABLES  AND  2 CBSERVATIONS  PER  VARIABLE 
— I . E . , 

2 3 4 

5 6 7 

THE  ABOVE  STRANS  STATEMENT  WOULD  RESULT  IN  DATA  WITH  2 VARIABLES  AND  3 
OBSERVATIONS  PER  VARIABLE  - I.E., 

2 5 

3 6 

4 7 


3.  THE  ASSIGNMENT  STATEMENT  ALLOWS  THE  USER  TO  DEFINE  NEW 
VARIABLES  OR  TO  REDEFINE  OLD  INPUT  VARIABLES.  THE  FORM  OF  THE 
STATEMENT  MAY  BE  ANY  LOGICALLY  CONSISTENT  COLLECTION  OF  OPERANDS, 
(VARIABLES  OR  CONSTANTS),  OPERATIONS,  AND  FUNCTIONS  EXPRESSING  A 


1.3 


CRASS 

GRASS  CCNTROL  CAROS 


14 


RELATION  BETWEEN  A DEPENDENT  VARIABLE  AND  ONE  OR  MORE  INDEPENDENT 
VARIABLES.  THE  OPERATIONS  WHICH  ARE  ALLOWED  ARE? 

♦ ADDITION 
SUBTRACITON 

♦ NULTIPL ICATION 

/ DIVISION 

f EXPONENTIATION 

UNLESS  OTHERWISE  INDICATED  BY  PARENTHESES  THE  OPERATIONS  ARE  PERFORMED 
IN  THE  FOLLOWING  ORDER? 

1)  EXPONENTIATION  IS  PERFORMED  FIRST. 

2)  MULTIPLICATION  AND  DIVISION  ARE  PERFORMED  NEXT. 

31  ADDITION  AND  SUBTRACTION  ARE  PERFORMED  LAST. 

OPERATIONS  OF  THE  SAME  PRIORITY  (I.E.  MULTIPLICATION  AND  DIVISION)  ARE 
PERFORMED  IN  ORDER  OF  OCCURENCE  FROM  LEFT  TO  RIGHT. 

EXAMPLES 


GRASS  ALGEBRAIC 

EXPRESSION  EQUIVALENT 


A*B-C 

A*<B-CI 

A*B/C*0 

A*B/(C«DI 

AtB*C 

AfIBFC) 

(A) (B)-C 
A(B-CI 

(AMB)ID)/(C) 

(A)(B)/ICnO) 

lA  RAISED  TO  THE  POWER  b)*C 
A RAISED  TO  THE  POWER  (B*C) 

THE  FOLLOWING  FUNCTIONS 

ARE  AVAILABLE  IN  THE  TRANS  COMPILER 

FUNCTION 

FUNCTION 

NAME 

PERFORMED 

SIN 

SINE 

COS 

COSINE 

TAN 

TANGENT 

AS  IN 

ARC  SINE 

ACOS 

ARC  COSINE 

AT  AN 

ARC  TANGENT 

LN 

LOG  BASE  E 

LOG 

LOG  BASE  10 

EXP 

EXPONENTIATION  BASE  E 

ABS 

ABSOLUTE  VALUE 

SORT 

SQUARE  ROOT 

PROBIT 

PROBIT  OF  A NUMBER  BETWEEN  0 AND 

APR06IT 

INVERSE  OF  PROBIT 

LINE 

WHEN  USED  AFTER  $LSQR  OR  APOLRG 

RANF 

LOGICAL 

COMPUTES  NEW  DEPENDENT  VARIABLE 
VALUES  USING  THE  LEAST  SQUARE 
ESTIMATES 

LOGICAL  VALUE  1 OR  0 
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EXAMPLE 

ATMANS  XI2)«  3*LNIl*SlNIXf llll 

THE  ABOVE  ATRANS  STATEMENT  WOULD  CAUSE  A NEW  SECOND  VARIABLE  TO  BE 
CREATED  EQUAL  TO  3 TIMES  THE  NATURAL  LOG  OF  THE  QUANTITY  1 PLUS  THE 
SINE  OF  VARIABLE  1. 

IN  CREATING  NEW  VARIABLES  THE  FOLLOWING  SHOULD  BE  NOTED? 

Al  WHEN  ANY  OF  THE  VALUES  OF  INDEPENDENT  VARIABLES  ARE  MISSING  THE 
CORRESPONDING  VALUES  OF  THE  DEPENDENT  VARIABLE  WILL  BE  MISSING. 

Bl  IF  THE  DEPENDENT  VARIABLE  IN  AN  ASSIGNMENT  STATEMENT  IS  AN  ALREADY 
EXISTING  VARIABLE*  THE  OLD  VALUES  WILL  BE  REWRITTEN  AS  DICTATED  BY  THE 
ASSIGMENT  STATEMENT. 

4.  THE  CONDITIONAL  STATEMENTS  TEST  A GIVEN  COMPARISON  AND  TAKE 
ONE  OF  TWO  BRANCHES*  (ASSIGNMENT  STATEMENTS)*  DEPENDING  ON  THE  RESULT 
OF  THE  COMPARISON.  THE  GENERAL  FORM  OF  THE  CONDITIONAL  STATEMENT  IS 
ATMANS  IF  A THEN  B 
ATMANS  ELSE  C 

WHERE  -A-  IS  A COMPARISON  STATEMENT*  AND  -B-  AND  -C-  ARE  ASSIGNMENT 
STATEMENTS.  A COMPARISON  STATEMENT  IS  COMPOSED  OF  TWO  EXPRESSIONS 
SEPARATED  BY  A SIGN  OF  COMPARISON*  A COMPARATOR.  THE  COMPARATORS 
AVAILABLE  TO  THE  USER  ARE? 

> EQUAL 

# NOT  EQUAL 

< LESS  THAN 

S LESS  THAN  OR  EQUAL 

> GREATER  THAN 

2 GREATER  THAN  OR  EQUAL 

IF  THE  COMPARISON  -A-  PROVES  TRUE*  THEN  THE  STATEMENT  DEFINED  AS  THE 
OBJECT  OF  THE  THEN  CLAUSE  >B-  IS  EXECUTED.  OTHERWISE  THE  OBJECT  OF 
THE  ELSE  CLAUSE  -C-  IS  EXECUTED.  THE  ELSE  CLAUSE  IS  OPTIONAL.  IF  IT 
IS  DESIRED  TO  LEAVE  UNCHANGED  THE  VALUE  OF  THE  DEPENDENT  VARIABLE  WHEN 
-A-  PROVES  FALSE*  THE  ELSE  CLAUSE  NEED  NOT  BE  INCLUDED. 

THE  OBJECTS  OF  THE  THEN  AND  ELSE  CLAUSES  MAY  BE  A 00  STATEMENT* 

IN  WHICH  CASE  ALL  OF  THE  ASSIGNMENT  STATEMENTS  FOLLOWING  THE  DO 
STATEMENT  WILL  BE  EXECUTED  UNTIL  AN  END  STATEMENT  TERMINATES  THE  LIST. 
DO  STATEMENTS  MAY  ALSO  BE  NESTED.  THIS  FACILITY  ALLOWS  THE  EXECUTION 
OF  SEVERAL  ASSIGNMENT  STATEMENTS  UPON  THE  RESULT  OF  A SINGLE 
COMPARISON  STATEMENT. 

IN  ADDITION*  THE  OBJECTS  OF  THE  THEN  OR  ELSE  STATEMENT  NAY  BE 
ANOTHER  IF  STATEMENT  ALLOWING  FURTHER  FLEXIBILITY. 
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EXAMPLES 

1.  STRANS  IF  Xf2l<LOC(Xmi  THEN  Xm«5«Xlll 
STRANS  ELSE  X(3l«l«4*ATAN(Xf 31 1 

WHENEVER  A VALUE  OF  X(2)  IS  LESS  THAN  LCG  BASE  10  OF  Xdl  THE 
THIRD  VARIABLE.  X(3).  WOULD  BE  ASSIGNED  A VALUE  EQUAL  TO  5 TIMES  THE 
VALUE  OF  xm.  OTHERWISE  THE  THIRD  VARIABLE  WOULD  BE  ASSIGNED  A VALUE 
EQUAL  TO  1 MINUS  4 TIMES  THE  ARCTANGENT  OF  THE  THIRD  VARIABLE.  THIS 
WOULD  NATURALLY  RESULT  IN  REPLACEMENT  OF  VALUES  PREVIOUSLY  ASSIGNED  TO 
THE  THIRD  VARIABLE. 

2.  STRANS  IF  X(2I>0  THEN  IF  L0GICALCXI2II«1  THEN  XI2I«1 

THIS  STATEMENT  WOULD  DETECT  MISSING  VALUES  IN  THE  SECOND 
VARIABLE.  XI2I.  AT^D  ASSIGN  A VALUE  OF  1 WHERE  THE  MISSING  VALUES 
QCCUREO. 


STRANS 

IF  XI3I>0  THEN  DO 

STRANS 

Xlll»Xf3l*5-2 

STRANS 

X(2l«SQRTIXt4ll 

STRANS 

END 

STRANS 

ELSE  DO 

STRANS 

XI1I«0 

STRANS 

Xf2l«0 

STRANS 

Xf3)«l 

STRANS 

END 

WHENEVER  A VALUE  OF  THE  THIRD  VARIABLE.  X(3)»  IS  GREATER  THAN  0. 
THE  FIRST  VARIABLE.  X(D.  WILL  BE  ASSIGNED  A VALUE  EQUAL  TO 
(X(3M5)l-2  AND  THE  SECCND  VARIABLE  WILL  BE  ASSIGNED  A NEW  VALUE  EQUAL 
TO  THE  SQUARE  ROOT  OF  THE  FOURTH  VARIABLE.  CTHERWISE.  XII)  AND  XI2I 
WILL  BE  ASSIGNED  A VALUE  OF  0 AND  X(3)  WILL  BE  REPLACED  BY  THE  VALUE 
1. 

•NOTE  the  FOLLOWING  REGARDING  THE  USE  OF  $TRANS  CAROS. 

II  FOLLOWING  ANY  TRANSFORMATION  THE  NUMBER  OF  VARIABLES  AND 
□BSERVATIJNS  IS  AUTOMATICALLY  UPDATED.  THE  SDATA  CARD  SHOULD  REFLECT 
THE  NUMBER  OF  VARIABLES  AND  OBSERVATIONS  INPUT  AND  NOT  THE  NUMBERS 
FOLLOWING  TRANSFORMATIONS. 

21  GROUPS  OF  STRANS  CAROS  WHICH  APPEAR  AFTER  THE  INPUT  DATA  HAS  BEEN 
ENTERED  MUST  BE  FOLLOWED  BY  A SEXECU  CARO  OR  ANY  STATISTICAL  ROUTINE 
SPECIFICATION  CARD.  FDk  EXAMPLE.  IF  THE  CAROS  IN  EXAMPLE  3 WERE 
INSERTED  AFTER  THE  INPUT  DATA  HAD  BEEN  ENTERED.  A SEXECU  CARO  OR  A 
STATISTICAL  ROUTINE  SPECIFICATION  CARO  MUST  FOLLOW  THE  STRANS  END 
STATMENT  before  ADDITIONAL  STRANS  STATEMENTS  COULD  BE  INSERTED. 

31  NO  MORE  THAN  lO.DDO  DATA  ELEMENTS  MAY  BE  HANDLED  WITH  ONE  SET  OF 
STRANS  CARDS  lI.E..  ONE  ASS  IGNEMNT  OR  TRANSPOSE  STATEMENT  OR  A 
CGNOITICNAL  STATEMENT  WITH  ITS  ASSIGNMENT  STATEMENTS!. 


l.<* 
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1.4  SAMPLE  PROBLEMS 

FIVE  SAMPLE  PROBLEMS  ARE  PRESENTED  TO  SHOW  ASSEMBLED  CARO  DECKS 
AND  OUTPUT  FOR  SOME  TYPICAL  GRASS  RUNS.  IN  THE  FIRST  EXAMPLEt  THE  SIX 
STEPS  USED  IN  PREPARING  THE  EXPERIMENTAL  DATA  WILL  BE  DETAILED.  IN 
ALL  FXARPLESf  THE  DATA  ARE  FICTITIOUS. 

EXAMPLE  1 

AN  INVESTIGATOR  HAS  4 VARIABLES  WITH  6 OBSERVATIONS  OF  EACH 
variable.  HE  DESIRES  TU  COMPUTE  MEANS  AND  STANDARD  DEVIATIONS  FOR 
EACH  VARIABLE,  CORRELATIONS  BETWEEN  TIME  ON  TREADMILL,  AGE,  BODY 
TEMPERATURE  AND  PULSE,  AND  A REGRESSION  LINE  RELATING  TIME  OF  WALK  TO 
THE  OTHER  3 VARIABLES.  THE  RAW  DATA  ARE  SHGuN  IN  TABLE  1. 


TABLE  1 


**»0******************************************** 


SUBJECT 

TREADMILL 

TIME  OF 

BODY 

PULSE* 

NUMBER 

SPEED(MPH) 

AGE 

WALK(MIN) 

TEMP. 

(BPM)* 

I 

5 

^0 

10. 0 

98.9 

123  ♦ 

2 

5 

IB 

15.8 

9B.7 

147  * 

3 

5 

35 

8.8 

99.  1 

109  ♦ 

4 

5 

24 

8.0 

98.6 

118  * 

•> 

5 

41 

12.5 

99.4 

131  ♦ 

6 

5 

43 

12.8 

99.7 

135  ♦ 

****************  *************************** 


THE  STEPS  FuR  PUTTING  THIS  DATA  IN  A SUITABLE  FORM  FOR  GRASS  USE 
ARE  AS  FOLLOWS. 

1.  ASSIGN  SEQUENTIAL  NUHBERICAL  IDENTIFIERS  TO  EACH  VARIABLE. 
VARIABLE  NUMERIC  IDENTIFIER 


AGE  I 
TIME  OF  WALK  2 
BODY  TEMP.  3 
PULSE  4 


2.  DETERMINE  THE  MAXIMUM  FIELD  WIDTH  (NUMBER  OF  CHARACTERS)  FOR 
EACH  VARIABLE. 

VARIABLE  MAXIMUM  FIELD  WIDTH 


AGE  2 

TIME  OF  WALK  4 (3) 

BODY  TEMP.  4 (3) 

PULSE  3 


NUMBERS  IN  PARENTHESES  ARE  THE  MAXIMUM  FIELD  WIDTH  IF  THE  DECIMAL  IS 
NOT  PUNCHED. 


3t4*5.  THE  DATA  ARE  ENTERED  ON  80  COLUMN  DATA  SHEETS,  (FIGURE 
II,  FOR  KEYPUNCHING.  IN  THIS  EXAMPLE,  THE  DECIMAL  POINT  IS  TO  BE 
PUNCHED  FOR  TIME  OF  wALK  AND  WILL  BE  IMPLIED  FOR  BODY  TEMPERATURE. 


GRASS 

SAMPLE  PROBLEMS 


6«  THE  DATA  COULD  BE  DESCRIBED  BY  THE  FORMAT  STATEMENTt 

$F0RNATIF2.0»F5.0*F4.1fF4.0l. 

THE  ASSEMBLED  CARO  DECK*  INCLUDING  SYSTEM  CONTROL  CARDS* 
INFORMATION  CONTROL  CARDS*  DATA  CAROS  AND  STATISTICAL  ROUTINE 
SPECIFICATION  CAROS*  IS  SHOWN  IN  FIGURE  2.  NOTE  THAT  THE  DATA  CARDS 
IMMEDIATELY  FOLLOW  THE  SFORNAT  CARD. 

COMPUTER  OUTPUT  FOR  THIS  RUN  IS  GIVEN  IN  FIGURES  3 THROUGH  15. 
FIGURE  4 IS  A LISTING  OF  THE  RAw  DATA.  FIGURE  5 IS  THE  OUTPUT  FROM 
THE  BASIC  STATISTICS  ROUTINE.  FIGURE  6 SHOWS  THE  PEARSON  CORRELATION 
COEFFICIENTS  FOR  ALL  VARIABLES.  FIGURES  7 THROUGH  12  GIVE  THE  OUTPUT 
FROM  THE  RANK  CORRELATION  ROUTINE.  FIGURES  13  THROUGH  15  ARE  THE 
RESULTS  FROM  MULTIPLE  REGRESSION. 


EXAMPLE  2 

AN  INVESTIGATOR  HAS  COLLECTED  DATA  ON  8 SUBJECTS.  THE  VARIABLES 
MEASURED  ARE  HEIGHT*  WEIGHT*  AGE.  AND  HEART  RATE.  THE  INVESTIGATOR  IS 
INTERESTED  IN  THE  RELATIONSHIPS  BETWEEN  THESE  VARIABLES.  HE  DECIDES 
TO  CALCULATE  BASIC  STATISTICS  FOR  ALL  VARIABLES  AND  PEARSON 
CORRELATIONS  BETWEEN  ALL  PAIRS  CF  VARIABLES.  HE  WISHES  TO  COMPUTE  THE 
REGRESSION  EQUATION  BETWEEN  HEART  RATE  AND  WEIGHT*  A PLOT  BETWEEN 
THESE  2 VARIABLES,  A FOURTH  DEGREE  POLYNOMIAL  REGRESSION  EQUATION 
BETWEEN  HEART  RATE  AND  HEIGHT,  AND  THE  BEST  REGRESSION  EQUATION 
RELATING  THE  DEPENDENT  VARIABLE,  WEIGHT,  WITH  THE  INDEPENDENT 
VARIABLES*  HEIGHT  AND  AGE.  A PLOT  BETWEEN  THESE  3 VARIABLES  IS  ALSO 
DESIRED.  THE  RAW  DATA  ARE  SHOWN  IN  TABLE  2. 


TABLE  2 


SUBJECT 

HEIGHTIINl 

ME IGHT(LBS) 

AGE 

HEART  RATE* 



— 

♦ 

1 

68 

150 

22 

70  * 

2 

69 

160 

23 

78  * 

3 

70 

169 

25 

79  * 

4 

71 

167 

27 

82  * 

5 

72 

170 

29 

83  * 

6 

73 

175 

24 

87  ♦ 

7 

74 

180 

23 

89  ♦ 

8 

75 

180 

21 

72  ♦ 

A LIST  OF  THE  ASSEMBLED  CARD  DECK  IS  SHOWN  IN  FIGURE  16.  FIGURES 
17  THROUGH  29  SHOW  THE  COMPUTER  OUTPUT,  THIS  EXAMPLE  DEMONSTRATES  HOW 
tCNT  AND  STITLE  CAROS  MAY  BE  USED  FOR  LABELING. 


EXAMPLE  3 

24  TEST  ANIMALS  HERE  RANDOMLY  ASSIGNED  IN  GROUPS  OF  8 TO  3 
TREATMENTS*  DIETS.  THE  ACTIVITY  OF  EACH  ANIMAL  MAS  MEASURED.  THE 
HYPOTHESIS  OF  NO  DIFFERENCES  BETWEEN  DIETS  COULD  BE  TESTED  USING 
EITHER  A ONE-WAY  ANALYSIS  OF  VARIANCE  OR  THE  KRUSKAL-MALLI S ROUTINE. 
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A PARAMETRIC  OR  A NON-PARAHETR IC  MULTIPLE  COMPARISON  PROCEDURE  COULD 
Be  USED  TO  DECIDE  WHICH  TREATMENTS  DIFFER.  (AN  INVESTIGATOR  CONFRONTED 
WITH  A SIMILAR  PROBLEM  SHOULD  CHOOSE  EITHER  THE  PARAMETRIC  OR 
NON-PARAMETRIC  APPROACH  TO  ANALYZING  THE  DATA.  BOTH  METHODS  ARE 
DEMONSTRATED  IN  THIS  EXAMPLE  ONLY  TO  SHOW  THE  DIFFERENT  OUTPUTS. I THE 
RAW  DATA  ARE  PRESENTED  IN  TABLE  3. 


«**«•*«« 

TABLE 

3 

t*«4*4>«** 

♦DIET  1 

DIET 

2 

DIET  3* 

«c 


1^  12  19 


15 

16 

17 

14 

12 

19 

17 

13 

17 

le. 

17 

18 

IB 

14 

19 

19 

13 

16 

15 

16 

17 

*«******«****4*«4««4«***** 

A LIST  OF  THE  ASSEMBLED  CARO  DECK  IS  SHOWN  IN  FIGURE  30.  FIGURES 
31  THROUGH  37  SHOW  THE  OUTPUT  FOR  THIS  EXAMPLE. 


EXAMPLE  4 

AN  INVESTIGATOR  PERFORMED  TWO  EXPERIMENTS.  IN  THE  FIRST  STUDY#  7 
test  ANIMALS  HERE  FED  A CONTROL  DIET  AND  THEIR  ACTIVITIES  WERE 
MEASURED.  THE  SAME  7 ANIMALS  WERE  THEN  FED  AN  ENRICHED  DIET  AND  THEIR 
ACTIVITIES  HERE  AGAIN  MEASURED.  IN  THE  SECOND  STUDY  20  TEST  ANIMALS 
MERE  RANDOMLY  ASSIGNED  IN  GROUPS  OF  10  TO  EITHER  THE  CONTROL  DIET  OR 
ENRICHED  DIET,  FOR  REASONS  UNRELATED  TO  EITHER  TREATMENT,  ONE  ANIMAL 
IN  THE  CONTROL  GROUP  AND  TWO  ANIMALS  IN  THE  TEST  GROUP  WERE  NOT 
MEASURED.  (THE  INVESTIGATORt S CHILD  OPENED  3 CAGES  AND  THE  ANIMALS 
ESCAPED.  THEY  WERE  LATER  RETRIEVED  FROM  SOME  GARBAGE  CANS  WHERE  THEY 
HAD  BEEN  FEASTING.  NATURALLY,  THEY  HAD  TO  BE  DI SQUALT I F lED . I THE 
INVESTIGATOR  HAS  INTERESTED  IN  THE  EFFECTS  OF  THE  ENRICHED  DIETS,  IF 
ANY,  ON  ACTIVITY.  HE  MIGHT  HAVE  USED  EITHER  A PAIRED  T-TEST  QR 
WILCOXONtS  SIGNED  RANKS  TEST  TO  EVALUATE  THE  DATA  FROM  THE  FIRST 
STUDY,  AND  HE  MIGHT  HAVE  USED  EITHER  A NON-PAIREO  T-TEST  QR  WILCOXONtS 
RANK  SUM  TEST  TO  ANALYSE  THE  DATA  FROM  THE  SECOND  STUDY.  THE  OUTPUT 
INCLUDES  ALL  4 TESTS.  THE  RAW  DATA  ARE  PRESENTED  IN  TABLE  4. 
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TABLE  4 

1 «»**«i«*«****p**««««*«p«p««***p««*p«***«**«A«*««««p«p««« 

1 * STUDY  1 

** 

STUDY 

2 * 

1 ♦CONTROL 

ENRICHED 

*♦ 

CONTROL 

ENRICHED* 

' ♦ DIET 

V * 

DIET 

*« 

DIET 

DIET  * 

'f  ♦ 15 

13 

♦* 

13 

15  ♦ 

1 ♦ 17 

19 

«♦ 

15 

19  ♦ 

S ♦ 10 

15 

** 

14 

13  ♦ 

♦ 19 

17 

** 

10 

20  * 

* 12 

16 

*♦ 

12 

17  ♦ 

♦ 13 

15 

** 

17 

16  ♦ 

♦ 16 

18 

«♦ 

17 

15  ♦ 

* 

** 

18 

18  ♦ 

♦ 

** 

15 

♦ 

1 A LISTING  OF 

THE  assembled 

CARO  DECK 

IS  GIVEN  IN 

FIGURE  38 

1 FIGURES  39  THROUGH 

47  SHOW  THE 

OUTPUT  FOR 

THIS  EXAMPLE 

• 

EXAMPLE  5 

EXAMPLE  5 SHOWS  SOME  USES  OF  ATRANS  AND  AAOO.  FIGURE  48  SHOWS 
THE  ASSEMBLED  CARD  DECK.  FIGURES  49  THROUGH  57  SHOW  THE  OUTPUT. 
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ALL  SYSTEM  CAROS  IN  THIS  AND  FOLLOWING  SECTIONS  ARE  FOR  THE 
C0C7600  BKY  OPERATING  SYSTEM.  IN  THE  FOLLOWING  EXAMPLES#  THE  GRASS 
CONTROL  CAROS*  ITHCSE  BEGINNING  WITH  A DOLLAR  SIGN)*  INDICATE  THAT 
THERE  ARE  10  INPUT  VARIABLES  WITH  15  OBSERVATIONS*  (CASES).  THE  INPUT 
DATA  IS  NOT  TO  BE  PRINTED  AND  BASIC  STATISTICS  ARE  TO  BE  CALCULATED 
FOR  ALL  VARIABLES. 

THE  FIRST  CARO  OF  EVERY  DECK  IS  THE  JOB  CARO.  THE  GENERAL 
ABBREVIATED  FORMAT  OF  THE  JOB  CARO  IS 

JOBNANE.ACCNUMtYOUR  («AHE 

WHERE 

JOBNAME  - 1-7  ALPHANUMERIC  CHARACTERS  BEGINNING  WITH  A LETTER 
IN  COLUMN  1 

ACCNUM  - YOUR  6 DIGIT  ACCOUNT  NUMBER 
YOUR  NAME  - ANYTHING  IN  THE  REMAINING  COLUMNS. 

LAIR  USERS  SHOULD  CONSULT  MEMORANDUM  FOR  ALL  UT200  USERS  DATED  AUGUST 
16*  1914  FOR  SPECIFIC  CHARACTERS  TO  BE  USED  FOR  -JOBNAME>*  >ACCNUN** 
AND  -YOUR  NAME-. 


EXAMPLES 

I.  INPUT  DATA  ON  CAROS 

JOB  CARO 

FETCHPSIGRASZ.GRAZZ.GRASS) 

GRAZZ. 

(7/8/9  MULTIPUNCH  CARO) 

$riTLE  EXAMPLE  WITH  INPUT  DATA  CN  CARDS 

(DATA  10,15*0 

$F0RMAT(10F5.0) 

DATA  CARDS 
S3STAT  ALL 
iEND 

(6/7/B/9  MULTIPUNCH  CARD) 


2.  INPUT  DATA  IN  FORMATTED  OR  BINARY  FORM  ON  MAGNETIC  TAPE. 

JOB  CARO 

FE TCHPS ( GRAS Z, GRAZZ, GRASS) 

STAGE, MY  TAPE, TAPENO. 

GRAZZ. 

(7/B/9  MULTIPUNCH  CARD) 

atitle  example  with  input  data  on  MYTAPE 

(DATA  10,15,0 

AFILE  MYTAPE  OR  AFILEB  MYTAPE 

AFORMAT( 10F5.0)  NO  FORMAT  STATEMENT 

DATA  CN  MYTAPE  MUST  BE  IN  BINARY  FORM 

A3STAT  ALL 
AcNO 

(6/7/B/9  MULTIPUNCH  CARO) 
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3.  INPUT  DATA  IN  FORMATTED  OR  BINARY  FORM  ON  A PERMANENT  FILE  DISK. 

JJd  CArtO 

FSTCHPSCGRASZ,GRAZZ,r.RASS) 

ATTACH(MYUATA,MYFILEi 

GRAZZ. 

C 7/8/9  MULTIPUNCH  CARD) 

sriTLE  EXAMPLE  hITH  INPUT  RATA  ON  DISK 

tOATA  lO.lSfO 

tElLc  MYOATA  OR  $FILEfl  MYOATA 

SFORUTI  10F5.0)  NO  FORMAT  STATEMENT 

DATA  ON  MYOATA  MLST  BE  IN  BINARY  FORM 

AbSTAT  ALL 
$END 

(6/T/8/9  MULTIPUNCH  CARD) 


4.  INPUT  DATA  IS  ON  CAROS  IN  FORMATTED  FORM.  TMO  NEM  VARIABLES 

ARE  CREATED  USING  STRANS  CAROS.  THESE  NEM  VARIABLES  ARE  MITTEN 
IN  FORMATTED  OR  BINARY  FORM  TO  A MAGNETIC  TAPE. 

JOB  CARD 

F=TCHPS(GRASZ,GRAZZ,GKASS) 

GRAZZ. 

STAGctNEMVARtTAPENOy W. 

(7/8/9  MULTIPUNCH  CARD) 

ATITLE  MRITING  NEW  VARIABLES  TC  NEMVAR. 

SDATA  I0«15»0 
tTRANS  X(ll)=2*  X( 1) 

STRANS  X(12)=20-X( 11) 

SFORMATI 10F5.C) 

Data  CAROS 
SBSTAT  ALL 
SMRITE  11.12 

SFILE  NEWVAR  OR  SFILEB  NEMVAR 

$FGRMAT(2F5.0)  no  FORMAT  CARO 

SEND 

(6/7/B/9  MULTIPUNCH  CARO) 


IF  NONE  OF  THESE  EXAMPLES  SUITS  YOUR  PROBLEM.  OR  MACHINE.  OR  IF 
YOU  ARE  UNSURE  ABOUT  THE  APPROPRIATE  SYSTEM  CAROS  TO  USE.  SPEAK  TO  A 
LOCAL  CONSULTANT. 
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I 8 

ATITLE  A TITLE 


PARANETEK  FIELD  (COLUMNS  8-79) 

ANY  ALPHANUMERIC  TITLE  IN  COLUMNS  8-79 


OUTPUT 

THE  title  is  printed  ON  THE  TOP  OF  ALL  PAGES  OF  OUTPUT  UNTIL 
ANOTHER  STITLE  CARD  IS  USED. 


USAGE 

1,  THE  TITLE  MUST  BE  CONTAINED  ON  ONE  8T1TLE  CARO. 

2.  COLUMN  80  MUST  NOT  BE  PUNCHED  ON  THIS  CARO. 


EXAMPLE 

1 8 

STITLE  ANALYSES  OF  MY  DATA 


B 
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1 


i 8 

fOATA  I1»I2»13 


DESCRIBING  INPUT  DATA 


PARAMETER  FIELD  ICOLUMNS  8-791 

11  NUMBER  OF  VARIABLES 

12  NUMBER  OF  CASES 

13  O-OC  NCT  PRINT  INPUT  DATA 
1-PRINT  INPUT  DATA 


OUTPUT 

1.  INPUT  DATA  IS  PRINTED  IF  13*1. 


USAGE 

1.  MAXIMUM  OF  50  INPUT  VARIABLES  II1S50) 

2.  NO  LIMIT  ON  NUMBER  CF  OBSERVATIONS 


EXAMPLES 


CARDS 

1 

SOATA 

8 

6«30«l 

COMMENTS 

6 VARIABLES, 

30  OBSERVATIONS 

PRINT  INPUT 

DATA 

$OATA 

7,10,0 

7 VARIABLES, 

10  OBSERVATIONS 

DC  NOT  PRINT 

INPUT  DATA 

B 
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WRITIN6  SPECIFIED  VARIABLES 


I 8 

tWRITE  IlfI2f.tI50 


PARAMETER  FIELD  t COLUMNS  8-79) 

11 

12 


..  VARIABLES  TO  BE  WRITTEN 


150 


RESULT 

1.  IF  SWRITE  IS  IMMEDIATELY  FOLLOWED  BY  A AFORMAT  CARDt  ALL 
VALUES  (CASES)  OF  THE  VARIABLES  SPECIFIED  IN  THE  PARAMETER 
FIELD  WILL  BE  WRITTEN  ACCORDING  TO  THE  FORMAT  SPECIFIED 

ON  THE  AFORNAT  CARO  TO  THE  LOCAL  FILE  BCOOUT. 

2.  IF  AWRITE  IS  FOLLOWED  BY  A AFILE  CARO  AND  A AFORMAT  CARDt 
ALL  CASES  OF  THE  VARIABLES  SPECIFIED  WILL  BE  WRITTEN  TO 
THE  FILE  NAMED  ON  THE  AFILE  CARO  IN  THE  FORMAT  SPECIFIED 
ON  THE  AFORMAT  CARO. 

3.  IF  AWRITE  IS  FOLLOWED  BY  A AFILEB  CARO,  ALL  CASES  OF  THE 
THE  VARIABLES  SPECIFIED  WILL  BE  WRITTEN  IN  BINARYt 
(NON-FORMATTEO) t FORM  TO  THE  FILE  NAMED  ON  THE  AFILEB  CARO. 
NO  AFORMAT  CARD  IS  USED  WITH  AFILEB.  IF  ONE  IS  USED  IT  WILL 
BE  IGNORED. 


USAGE 

1.  A MAXIMUM  OF  50  VARIABLES  MAY  BE  WRITTEN. 

2.  NO  LIMIT  ON  NUMBER  OF  CASES  WRITTEN. 


EXAMPLES 
CAROS 
1 8 
AWRITE  It3t5 
AFORMAT (3F5.0) 


COMMENTS 

VARIABLES  lt3t  AND  5 WILL  BE 
WRITTEN  TO  BCOOUT  IN  3 FIELDS  OF  F5.0. 


AWRITE  It3t5  VARIABLES  lt3  AND  5 WILL  BE 

AFILE  MYOATA  WRITTEN  TO  HYOATA  IN  3F5.0  FORMAT. 

AF0RMATI3F5.0) 


AWRITE  It3t5  VARIABLES  it3  AND  5 WILL  BE  WRITTEN 

AFILEB  BINDATA  TO  BINOATA  IN  BINARY  (UNFORMATTED)  FORM. 
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1 

COOED  FILES  | 

1 a 

SFILE  LFNANE 


PARAMETER  FIELD  (COLUMNS  8-14) 

1.  THE  LOCAL  FILE  NAME  OF  THE  COOEO*  FORMATTED.  FILE  THAT 
CONTAINS  THE  INPUT  DATA.  OR 

2.  THE  LOCAL  FILE  NAME  OF  THE  FILE  THE  DATA  IS  TO  BE 
WRITTEN  TO  IN  FORMATTED  FORM. 


USAGE  ^ 

1.  WHEN  READING  FORMATTED  DATA  FROM  A SOURCE  OTHER  THAN  CAROS.  I 

THE  SFILE  CARO  MLST  COME  BETWEEN  THE  SOATA  AND  AFORNAT  CAROS. 

2.  WHEN  WRITING  FORMATTED  DATA  TO  AN  ALTERNATE  OUTPUT  DEVICE. 

THE  AFILE  CARO  MUST  COME  BETWEEN  THE  AWRITE  AND  AFORHAT  CARDS.  ! 

IF  NO  LOCAL  FILE  NAME  IS  PUNCHED  IN  THE  PARAMETER  FIELD  OF  i 

THE  AFILE  CARO,  DATA  SPECIFIED  WILL  BE  WRITTEN  TO  BCDOUT. 

RESULT 

1.  INPUT  DATA  WILL  BE  READ  FROM  THE  FILE  NAMED,  OR 

2.  SPECIFIED  DATA  WILL  BE  WRITTEN  TO  THE  FILE  NAMED. 


EXAMPLES 

1.  FOR  READING  DATA  FROM  AN  ALTERNATE  INPUT  SOURCE  SEE 
APPENDIX  A,  EXAMPLES  2 AND  3. 

2.  FOR  WRITING  DATA  TO  AN  ALTERNATE  OUTPUT  DEVICE  SEE 
APPENDIX  A,  example  4. 


■I 


1 


B 
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BINARY  #ILES 


I 3 

afileb  lfnane 


PARAMETER  FIELD  ICCLUMNS  8-14)  ct.  c tuat 

1,  THE  LOCAL  FILE  NAME  OF  THE  BINARYt  (ONFORHATTEOI t FILE  THAT 

CONTAINS  THE  INPUT  DATAt  OR 

2.  THE  local  file  NAME  OF  THE  BINARY  FILE  THAT  SPECIFIED 
VARIABLES  ARE  TO  BE  WRITTEN  TC  IN  BINARY  FORM. 


USAGE 

1. 


■3. 


WHEN  READING  DATA  FROM  BINARY  FILEt  THE  AFILEB  CABO 
FOLLOWS  THE  AOATA  CARO.  _ ^ 

WHEN  WRITING  A BINARY  FILE#  THE  AFILEB  CARO  FOLLOWS  THE 

AHRITE  CARO.  , ^ 

NO  AFORHAT  CARD  IS  USED  WITH  THE  AFILEB  CARO. 


"^^'^ATA  MAY  BE  READ  FRCM  CR  WRITTEN  TO  A BINARY  FILE. 


EXAMPLE 

1. 

2. 


TO  READ  DATA  FRCM  A BINARY  FILE#  SEE  APPENDIX  A# 
EXAMPLES  2 AND  3. 

TO  WRITE  A BINARY  FILE,  SEE  APPENDIX  A,  EXAMPLE  4. 
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FORMAT  STATEMENT 

1 8 

AFOlUIATIFORTRAM  FORMAT  STATEMENT I 


PARAMETER  FIELD  I COLUMNS  8-79) 

ANY  FORTRAN  FORMAT  STATEMENT  USING  Ft  Et  AND/OR  X SPECIFICATIONS 


RESULT 

DESCRIBES  THE  FIELDS  OF  INPUT  DATA. 

I 

i 

\ USA6E 

1.  A FORMAT  STATEMENT  NAY  BE  CONTINUED  ON  FOLLOWING  CAROS 
> BUT  THE  TOTAL  LENGTH  CANNOT  EXCEED  250  CHARACTERS. 

I 2 THE  SFORMAT  CARO  MUST  FOLLOW  THE  BOATA  CARO  IF  INPUT 

I DATA  IS  ON  CAROS  IN  FORMATTED  FORM. 

[ 3 THE  BFORMAT  CARO  FOLLOWS  THE  BFILE  CARO  IF  INPUT  DATA 

[ IS  ON  AN  ALTERNATE  INPUT  SOURCE  IN  FORMATTED  FORM  OR 

IF  VARIABLES  ARE  TO  BE  WRITTEN  IN  FORMATTED  FORM  TO  AN 
ALTERNATE  OUTPUT  CEVICE. 

4.  SFORMAT  IS  NOT  USED  WITH  THE  SFILEB  CARO. 


EXANFLCS 
CAROS 
1 6 

SFORMAT I 2F3. 2. iX»F4.2) 
SFORMAT (F4.1tF3.0tF4.2tF2. 01 


B 


DESCRIPTION  OF  INFORMATION  CONTROL  CAROS 


29 


FORMFREE  INPUT  DATA 


1 8 
8FMFREE 


PARNETER  FIELD  (COLUMNS  8-791 
NO  ENTRY 


RESULT 

DATA  PUNCHED  IN  A FORMAT-FREE  MANNER  IS  READ. 


USAGE 

1.  DATA  VALUES  ARE  PUNCHED  ON  CAROS  IMMEDIATELY  FOLLOWING 
8FNFREE  CARC. 

2.  EACH  DATA  VALUE  MAY  BE  PUNCHED  IN  EITHER  F-TYPE  OR 
E-TYPE  FORMAT  WITH  AS  MANY  DIGITS  AS  OESIREOt  BUT 
A DECIMAL  POINT  MUST  BE  PUNCHED. 

3.  DATA  VALUES  MUST  BE  SEPARATED  BY  BLANK  SPACES  OR  COMMAS. 

4.  AS  MANY  VALUES  PER  CARD  AS  DESIRED  MAY  BE  PUNCHEDt  BUT 
A DATA  VALUE  MUST  BE  PUNCHED  ENTIRELY  ON  ONE  CARD. 

5.  DATA  VALUES  FOR  EACH  OBSERVATION  MUST  BEGIN  ON  A NEW  CARO. 


B. 
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COMMENT  CARD 


1 8 

8CNT  AMY  COMMENT 


PARAMETER  FIELD  (COLUMNS  8>79» 
ANY  COMMENT 


30 


OUTPUT 

THE  COMMENT  IN  COLUMNS  8-79  WILL  BE  PRINTED  ON  THE  NEXT 
PAGE  OF  OUTPUT.  IF  TWO  OR  MORE  ACMT  CAROS  ARE  IN  SeOUENCe, 
ALL  COMMENTS  WILL  BE  PRINTED  ON  THE  NEXT  PAGE  USING  TRIPLE 
SPACING. 


USAGE 

COLUMN  80  MUST  NOT  BE  PUNCHED  ON  THE  ACNT  CARO. 


EXAMPLE 

COLUMN 
1 8 

ACMT  VARIABLE  3 IS  HEART  RATE. 


B 
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REUINO 

1 8 

BRNO  LFNANE 


PARAMETER  FIELD  (COLUMNS  8-141 

THE  LOCAL  FILE  NAME  OF  THE  FILE  TO  BE  REMOUNO 


RESULT 

THE  FILE  SPECIFIED  WILL  BE  REWOUND.  \ 


USAGE 

1.  ONLY  ONE  FILE  HAY  BE  REWCUNO  WITH  ANY  ONE  BRNO  CARO. 

2.  THE  LOCAL  FILE  NAMED  MUST  REFER  TO  THE  FILE  IN 
CURRENT  USE. 

3.  DO  NOT  REMIND  TAPES  OR  TAPE4. 


COMMENTS 

8 

MYOATA  MVOATA  WILL  BE  REWOUND. 

MYTAPE  MYTAPE  WILL  BE  REWOUND. 


EXAMPLES 

CAROS 

1 

BRNO 

BRWO 


I 


a 
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HISTOGRAM  SCALING 

1 B 

ASCALE  YltV2 


PARAMETER  FIELD  (COLUMNS  B-79) 
Yl  - MINIMUM 
Y2  - MAXIMUM 


RESULT 

SETS  THE  MINIMUM  AND  MAXIMUM  FOR  THE  BASE  AXISt  AND 
THEREBY  DETERMINES  THE  SCALE  FACTOR  FOR  THE  HISTOGRAM 
1UUTINE. 


USAGE 

1.  IF  THE  ASCALE  CARD  IS  USED,  IT  MUST  PRECEEO  THE  AHSTGM  CARO, 

2,  ABSENCE  OF  THE  ASCALE  CARO  WILL  CAUSE  AUTO-SCALING. 


EXAMPLES 
CARDS 
1 8 
ASCALE  20,  160 
ASCALE  10.3,  100.3 


COMMENTS 

SCALE  FROM  20  TO  160. 
SCALE  FROM  10.3  TO  100.3. 


DESCRIPTION  OF  INFORHATIQN  CONTROL  CAROS 


PLOT  SCALING 


1 8 

tSCALE  VltV2tXltX2 


PAIUNETER  FIELD  (COLUMNS  8-791 

Yl  - MINIMUM  FOR  THE  DEPENDENT  VARIABLE  (Y-AXISi 

Y2  - MAXIMUM  FOR  THE  DEPENDENT  VARIABLE  (Y-AXIS) 

XI  - MINIMUM  FOR  THE  INDEPENDENT  VARIABLE  (X-AXIS) 

X2  - MAXIMUM  FOR  THE  INDEPENDENT  VARIABLE  (X-AXIS) 


RESULT 

SETS  THE  MINIMUM  AND  MAXIMUM  FOR  THE  X AND/OR  Y AXISt 
AND  THEREBY  DETERMINES  THE  SCALE  FACTORS  FOR  THE  PLOT 
ROUTINES. 


USAGE 

1.  WHEN  USED*  THE  ASCALE  CARD  MUST  PRECEEO  EITHER  THE 
SPLOTt  APLOTNt  OR  APLOTI  CARO. 

2.  ABSENCE  OF  THE  ASCALE  CARO  WILL  CAUSE  AUTO-SCALING. 

3.  IF  ONLY  ONE  AXIS  IS  TO  BE  MANUALLY  SCALED*  THEN  THE 
MINIMUM  AND  MAXIMUM  VALUES  OF  THE  AUTO  SCALED  AXIS 
MUST  BOTH  BE  SET  TO  ZERO. 


EXAMPLES 
CAROS 
L 8 

ASCALE  0*0,10*300 
ASCALE  30*100*4.3*6.9 


COMMENTS 

AUTO  SCALING  ON  THE  Y-AXIS* 

SCALING  FROM  10  TO  300  ON  THE  X-AXIS 
SCALING  FROM  30  TO  100  ON  THE  Y-AXIS 
SCALING  FROM  4.3  TO  6.9  ON  THE  X-AXI 
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COMBINING  VARIABLES  - SAOO 

1  8 

$AOO  I1*I2«I9 


PARAMETER  FIELD  I COLUMNS  6-79) 

11  - VARIABLE  WHICH  IS  TO  BE  ON  THE  TOP 

12  - VARIABLE  WHICH  IS  TO  BE  ON  THE  BOTTOM 

13  - CREATED  NEW  VARIABLE 


RESULT 

TWO  VARlABLESt  II  AND  12t  ARE  VERTICALLY  JOINED  IN  A NEW 
VARIABLEt  13. 


USAGE 

1.  THE  BOTTOM  VARIABLE*  12*  MAY  CONTAIN  NO  MORE  THAN 
200  OBSERVATIONS*  (CASES)*  IGNORING  BLANKS. 

2.  OLD  VARIABLES  CAN  AND  WILL  BE  REWRITTEN  WITH  INCORRECT 
USE  OF  13. 


EXAMPLE 

SUPPOSE  THERE  ARE  2 VARIABLES  WITH  THE  VALUES 

1 5 

2 6 

3 7 

4 8 

SAOO  1*2*3  WOULD  RESULT  IN  THE  CREATION  OF  A 3RD  VARIABLE 
WITH  VALUES 

1 

2 

3 

4 

5 

6 

7 

8 


B 
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END 


1 8 
BEND 


PARAMETER  FIELD 

NO  ENTRY 


RESULT 

SEND  INDICATES  THAT  THE  CURRENT  GRASS  RUN  IS  FINISHED. 


USAGE 

THE  SEND  CARO  MUST  BE  THE  LAST  GRASS  CONTROL  CARO. 


I 
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( 

1 


BASIC  STATISTICS 


SBSTAT 


PARAMETER  FIELD  (COLUMNS  8-791 
11 
12 

• • 

..  INPUT  VARIABLES 

• • 

ISO 


OUTPUT 

1.  VARIABLES 

2.  OBSERVATIONS 

3.  MINIMUM 

4.  maximum 

5.  RANGE 

6.  MEDIAN 

7.  MEAN 

8.  STANDARD  DEVIATION 

9.  ST.  ERROR  OF  THE  MEAN 


USAGE 

MAXIMUM  OF  50  VARIABLES 

NO  LIMIT  ON  NUMBER  OF  OBSERVATIONS 

THE  MEDIAN  WILL  NOT  BE  CALCULATED  IF  CBS ERVAT I0NS>500. 


EXAMPLE 

CAROS  COMMENTS 

1 8 

ABSTAT  ^t5f9,10  BASIC  STATISTICS  FOR  VARIABLES  4»5t9  AND  10 

ABSTAT  ALL  BASIC  STATISTICS  FOR  ALL  VARIABLES 


j 


i 


] 


c 
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FOR  EACH  VARIABLE  REQUESTED  THE  BSTAT  ROUTINE  COMPUTES  A VARIETY 
OF  DESCRIPTIVE  STATISTICS.  THESE  INCLUDE 

1.  MINIMUM  - THE  OBSERVATION  WITH  THE  SMALLEST  VALUE 

2.  MAXIMUM  - THE  OBSERVATION  WITH  THE  LARGEST  VALUE 

3.  RANGE  - THE  MAXIMUM  VALUE  MINUS  THE  MINIMUM  VALUE 

4.  MEDIAN  - THE  VALUE  FOR  WHICH  50->  OF  THE  VALUES  ARE  GREATER  AND 
50'<  ARE  LESS.  FOR  AN  ODD  NUMBER  OF  OBSERVATIONS*  THE  MEDIAN  IS  THE 
MIDDLE  value  in  A SEQUENTIALLY  ORDERED  SET.  FOR  AN  EVEN  NUMBER  OF 
OBSERVATIONS.  THE  MEDIAN  IS  THE  AVERAGE  OF  THE  TWO  MIDDLE  VALUES. 

5.  MEAN  - THE  AVERAGE  OF  THE  CBSERVATI3NS 


6.  STANDARD  DEVIATION  - THE  STANDARD  DEVIATION  IS  DEFINED  AS  THE 

SQUARE  ROOT  OF  THE  SUM  OF  THE  SQUARED  DEVIATION  OF  EACH  OBSERVATION 
FROM  THE  MEAN  DIVIDED  BY  THE  NUMBER  OF  OBSERVATIONS  LESS  1. 


SC=  SQRTISUMI (X( I)-MEANX)t2I/IN-ll  1 


WHERE*  XII)  IS  AN  OBSERVATION*  McANX  IS  THE  OVERALL  MEAN*  t MEANS 
RAISED  TO  THE  FOLLOmING  POWER*  N IS  THE  NUMBER  OF  OBSERVATIONS  AND  SUM 
INDICATES  THE  SUM  OVER  ALL  OBSERVATIONS.  THE  STANDARD  DEVIATION  IS 
COMPUTED  BY  THE  FORMULA 

SUMIXI I)t2)-l (SUM  X(I))t2]/N 


V 


N-1 


V IS  THE  VARIANCE. 


SO  > SQRT(V) 


7.  STANDARD  ERROR  OF  THE  MEAN  - STO.  ERROR  > SO/SQRTIN)  * WHERE 

SO  IS  THE  STANDARD  DEVIATION  AND  N IS  THE  NUMBER  OF  OBSERVATIONS. 


c 
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NON>PAIREO  T-TEST 


1 8 

8NTST  Il»I2t...,I50 


PARAMETER  PIELO  COLUMNS (8-79) 

11 

12 


INPUT  VARIABLES 


• • 

ISO 


OUTPUT 

1.  MEANS#  STANCARO  DEVIATIONS#  AND  NUMBER  OF  OBSERVATIONS# 
(CASES)#  FOR  ALL  INPUT  VARIABLES  ARE  PRINTED. 

2.  T-VALUES#  DEGREES  OF  FREEDOM#  AND  LEVELS  OF  SIGNIFICANCE  OF 
T-VALUES  FOR  ALL  PAIRS  OF  INPUT  VARIABLES  ARE  PRINTED. 


USAGE 

A MAXIMUM  OF  SO  VARIABLES  MAY  BE  USED. 

THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 


CAROS 

1 

b 

COMMENTS 

8NTST 

3.5 

A NON-PAIREO  T-TEST  BETWEEN 
VARIABLES  3 AND  5 WILL  BE  PERFORMED. 

8HTST 

ALL 

NON-PAIREO  T-TESTS  BETWEEN  ALL 
PAIRS  OF  INPUT  VARIABLES  WILL  BE 
PERFORMED. 

c 
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IN  GENERAL.  THE  NON-PAIREO  T>TeST  IS  USED  TO  TEST  HYPOTHESES 
CONCERNING  THE  MEANS  OF  2 POPULATIONS.  THE  USUAL  HYPOTHESIS  IS  THAT 
THE  MEANS  OF  THE  2 POULATIONS  ARE  EQUAL.  A RANDOM  SAMPLE  IS  CHOSEN 
FROM  EACH  OF  THE  2 POPULATIONS.  THE  SAMPLE  MEANS  ARE  USED  TO  MAKE 
INFERENCES  ABOUT  THE  POPULATION  MEANS.  IT  IS  APPROPRIATE  TO  USE  THE 
NON-PAIREO  T-TEST  TO  HELP  MAKE  THESE  INFERENCES  WHEN  THE  DESIGN  OF  THE 
EXPERIMENT  INCLUDES  SAMPLING  FROM  ONLY  2 POPULATIONS.  IF  INFERENCES 
ARE  TO  BE  MADE  ABOUT  THE  MEANS  OF  MORE  THAN  2 POPULATIONS.  THE 
ANALYSIS  OF  VARIANCE  OR  THE  KRUSKAL'WALL I S TEST  SHOULD  BE  USED, 

A TEST  STATISTIC.  T.  IS  CALCULATED  WHICH  MAY  BE  USED  TO  TEST  THE 
HYPOTHESIS  THAT  THE  TRUE  MEAN  OF  POPULATION  1.  MUl.  EQUALS  THE  TRUE 
MEAN  OF  POPULATION  2.  MU2.  THE  TEST  STATISTIC  IS  CALCULATED  AS 

T«  <MX1-MX2I/SQRT(V/N1  ♦ V/N2) 

WHERE  MX1.MX2  ARE  THE  MEANS  OF  SAMPLES  1 AND  2. 

N1.N2  ARE  THE  NO.  OF  OBSERVATIONS  IN  SAMPLES  I AND  2. 

AND  V IS  THE  POOLED  VARIANCE  OF  THE  2 SAMPLES. 

THE  FORMULA  FOR  -V-  IS 


V«  [(Nl-llVl  ♦ (N2-1IV21/N14N2-2 

WHERE  VL  AND  V2  ARE  THE  VARIANCES  OF  SAMPLES  1 AND  2.  T HAS  DEGREES 
OF  FREEDOM  (O.F.)  > Nl«N2-2.  THE  LEVEL  OF  SIGNIFICANCE.  ALPHA, 

PRINTED  IN  THE  OUTPUT  IS  EQUAL  TO  TWICE  THE  PROBABILITY  OF  FINDING  A 
RANDOM  VALUE  OF  T GREATER  THAN  THE  ABSOLUTE  VALUE  OF  THE  OBTAINED  T. 

IN  OTHER  WORDS.  ALPHA  IS  THE  SMALLEST  SIGNIFICANCE  LEVEL  AT  WHICH  THE 
HYPOTHESIS  MU1«MU2  COULD  BE  REJECTED  IN  FAVOR  OF  THE  TWO-SIOEO 
alternative  HYPOTHESIS.  MU1AMU2.  FOR  THE  SMALLEST  SIGNIFICANCE  LEVEL 
FOR  WHICH  THE  HYPOTHESIS  MU1»MU2  COULD  BE  REJECTED  IN  FAVOR  OF  EITHER 
OF  THE  ONE-SIDED  ALTERNATE  HYPOTHESES .MUKMU2  OR  NU1>MU2»  USE  ALPHA/2. 


REFERENCE 

STEELE.  R.G.D.  AND  J.H.  TORRIE  (L960t  PRINCIPLES  AND  PROCEDURES  OF 
STATISTICS  MCGRAW-HILL.  NEW  YORK.  PP  73-78. 


c 
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MIRES  T-fEST 

1 8 
8RTST 


RARANETER  FIELD  tCOLUMNS  8-79) 

11 

12 

• • 

..  INRUT  VARIABLES 
• • 

125 


OUTPUT 

1.  MEANS t STANCARO  DEV lAT IONS t AND  NUMBERS  OF  OBSERVATIONS 
ARE  PRINTED  FOR  ALL  INPUT  VARIABLES. 

2.  T-VALUES.  DEGREES  OF  FREEDOM,  AND  LEVELS  OF  SIGNIFICANCE  OF 
T-VALUES  ARE  PRINTED  FOR  ALL  PAIRS  OF  INPUT  VARIABLES. 


USAGE 

A MAXIMUM  OF  25  VARIABLES  MAY  BE  USED. 

THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS,  ICASES). 


EXAMPLES 


CAROS 

8 

♦ ,6 

COMMENTS 

8PTST 

A PAIRED  T-TEST 
4 AND  6 MILL  BE 

BETWEEN  VARIABLES 
PERFORMED. 

$PTST 

ALL 

PAIRED  T-TESTS 
INPUT  VARIABLES 

BETWEEN  ALL  PAIRS  OF 
WILL  BE  PERFORMED. 

k.  _ A 


c 
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[ PAIRED  T-TESTS  ARE  USED  TO  TEST  THE  HYPOTHESIS  THAT  THE  MEAN  OF  THE 

i DIFFERENCES*  NUOt  BETWEEN  TWO  POPULATIONS  IS  0.  THE  SAMPLES  MUST  BE 

I OAIRED.  SCME  EXAMPLES  OF  PAIRED  SAMPLES  ARE  BEFORE  AND  AFTER 

[TREATMENTS  ON  THE  SAME  SUBJECT  OR  SAMPLES  IN  WHICH  SUBJECTS  ARE  PAIRED 
ON  SOME  CRITERIA  OTHER  THAN  THAT  BEING  MEASURED.  THE  DECISICION  OF 
whether  OR  NOT  TO  USE  PAIRING  MUST  BE  MADE  BEFORE  THE  EXPERIMENT  IS 
CONDUCTED.  PAIRED  T-TESTS  SHOULD  NCT  BE  USED  UNLESS  THE  SAMPLES  WERE 
IN  FACT  PAIRED. 

1 the  test  STATISTIC*  T*  IS  DEFINED  AS 

i T«  NO/SQRTIVO/NI 


WHERE  MD  IS  THE  AVERAGE  OF  THE  DIFFERENCES  BETWEEN  SAMPLES  1 AND 
VO  IS  THE  VARIANCE  OF  THE  DIFFERENCES* 

AND  N IS  THE  NUMBER  OF  PAIRS  OR  DIFFERENCES, 
r HAS  DEGREES  OF  FREEDOM  ID.F.I  EQUAL  TO  N-1. 


2* 


TWICE  THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  T GREATER  THAN 
THE  ABSOLUTE  VALUE  OF  THE  OBTAINED  T VALUE  IS  PRINTED  AS  THE  LEVEL  OF 
SIGNIFICANCE*  ALPHA.  THIS  IS  THE  SMALLEST  SIGNIFICANCE  LEVEL  AT  WHICH 
THE  HYPOTHESIS*  NU0«0*  MAY  BE  REJECTED  IN  FAVOR  OF  THE  TWO-SIOED 
ALTERNATE  HYPOTHESIS  NU0«0.  FOR  THE  SMALLEST  SIGNIFICANCE  LEVEL  AT 
WHICH  THE  HYPOTHESIS  NU0«0  COULD  BE  REJECTED  IN  FAVOR  OF  EITHER  OF  THE 
ONE-SIDED  ALTERNATE  HYPOTHESES*  NU0<0  OR  NU0>0.  USE  ALPHA/2. 


REFERENCE 

STEELE*  R.G.D.  AND  J.H.  TORRIE  11960)  PRINCIPLES  AND  PROCEDURES  OF 
STATISTICS.  MCGRAW-HILL*  NEW  YORK.  PP  7B-80. 


J 


c 
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WlLCOXONtS  RANK  SUN  TEST 


1 8 
SRNRSN  11.12 


PARAMETER  FIELD  (COLUMNS  8-79) 

11  - THE  FIRST  VARIABLE 

12  - THE  SECOND  VARIABLE 


OUTPUT 

1.  THE  RELATIVE  RANKS  OF  THE  SECOND  INPUT  VARIABLE*  12* 

AND  THE  SUM  OF  THOSE  RANKS*  THE  WILCOXON  STATISTIC*  M* 

ARE  PRINTED. 

2.  WHEN  THE  TOTAL  NUMBER  OF  OBSERVATIONS  IN  II  AND  12  IS 

LESS  THAN  16*  UPPER  TAIL  PROBABILITIES  FOR  AND  FOR 

«N(M»N4>1I-W*  ARE  PRINTED  WHERE  M>  NO.  OF  OBSERVATIONS 
OF  THE  FIRST  VARIABLE*  II*  AND  N«  NO.  OF  OBSERVATIONS 
OF  THE  SECOND  VARIABLE*  12. 

3.  THE  LARGE  SAMPLE  APPROXIMATION  TO  -W-  AND  THE  LEVEL 

OF  SIGNIFICANCE  OF  THE  LARGE  SAMPLE  APPROXIMATION  ARE  PRINTED. 

4.  THE  RELATIVE  RANKS  QF  THE  SECOND  INPUT  VARIABLE*  12* 

FOLLOWING  A RERANKING  FOR  THE  SIEGEL-TUKEY  TEST  ARE  OUTPUT. 

THE  SUM  OF  THESE  RANKS  IS  ALSO  OUTPUT. 


USAGE 

1.  TO  AVOID  DIFFICULTIES  IN  THE  INTERPRETATION  OF  THE  OUTPUT* 
THE  VARIABLE  WITH  THE  LESSER  NUMBER  OF  OBSERVATIONS  SHOULD 
BE  INPUT  AS  THE  SECOND  VARIABLE*  12. 

2.  THE  TOTAL  NUMBER  OF  OBSERVATIONS  MAY  NOT  EXCEED  1000*  AND 
EACH  VARIABLE  MAY  CONTAIN  NO  MORE  THAN  500  OBSERVATIONS. 


EXAMPLE 

CARD  COMMENTS 

1 8 

SRNKSM  9*7  WILCQXONtS  RANK  SUM  TEST  WILL  BE 

PERFORMED  FOR  VARIABLES  9 AND  7. 


c 
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I.  THE  WILCOXON  RANK  SUM  STATISTIC  IS  USED  TC  TEST  HYPOTHESES  ABOUT 
THE  DIFFERENCE  BETWEEN  THE  MEDIANS  OF  TWO  POPULATIONS.  IN  PARTICULAR, 
THE  NULL  HYPOTHESIS  IS  NEOIP1I>NE01P2I>OELTA«0*  WHERE  MEDIPII  AND 
MEOIP2)  ARE  THE  MEDIANS  OF  POPULATIONS  I AND  2. 


SUPPOSE  WE  HAVE  OBTAINED  2 SAMPLES  OF  K«N«N  OBSERVATIONS,  XI,..., 
XN  ANO  VI,...,  VN.  FIRST  THE  K CBSERVATICNS  ARE  ORDERED  FROM  LEAST  TO 
GREATEST.  THE  ORDERED  OBSERVATIONS  ARE  ASSIGNED  RANKS  I TO  K.  TIED 
OBSERVATIONS  ARE  ASSIGNED  THE  AVERAGE  RANK  OF  THE  TIED  OBSERVATIONS. 
THE  SUM  OF  THE  N RANKS  OF  THE  V VARIABLE  IS  THE  RANK  SUM  STATISTIC,  M. 
THE  SMALLEST  SIGNIFICANCE  LEVEL,  ALPHAl,  AT  WHICH  THE  HYPOTHESIS, 
NE0frVI*ME0IPX)«0ELTA>0,  COULD  BE  REJECTED  IN  FAVOR  OF  THE  ALTERNATE 
HYPOTHESIS,  DELTA>0  IS  PRINTED  AS  THE  UPPER  TAIL  PROBABILITY  OF  M. 

THE  SMALLEST  SIGNIFICANCE,  ALPHA2,  AT  WHICH  THE  HYPOTHESIS,  DELTAS 
COULD  BE  REJECTED  IN  FAVOR  OF  THE  ALTERNATE  HYPOTHESIS  0ELTA<0  IS 
PRINTED  AS  THE  UPPER  TAIL  PROBABILITY  OF  !•«.  FOR  THE  LEVEL  OF 

SIGNIFICANCE  AGAINST  THE  TWO-SIDED  ALTERNATE  HYPOTHESIS,  DELTAPO, 
CALCULATE  ALPHA«ALPNA1»I1>ALPHA2I  IF  ALPHA2>ALPHA1  OR 
ALPHA«U-ALPNA1I«’ALPNA2  IF  ALPHA1>ALPHA2..  IF  If  IS  NOT  A WHOLE 
NUMBER,  IT  IS  ROUNDED  TO  THE  NEXT  HIGHEST  WHOLE  NUMBER  FOR  CALCULATION 
OF  ALPHAl,  ANO  NIN*IF»1I-H  IS  ROUNDED  TO  THE  NEXT  LOWEST  WHOLE  NUMBER 
FOR  CALCULATION  OF  ALPNA2. 

FOR  EASE  OF  INTERPRETATION  ALWAYS  USE  NAN.  THIS  CAN  BE 
ACCOMPLISHED  BY  DESIGNATING  THE  VARIABLE  WITH  THE  SMALLER  NUMBER  OF 
OBSERVATIONS  AS  THE  SECOND  INPUT  VARIABLE,  12. 

ALL  OUTPUT  INCLUDES  THE  LARGE  SAMPLE  APPROXIMATION  TO  M,  HA.  FOR 
LARGE  K,  MA  FOLLOWS  A STANDARD  NORMAL  DISTRIBUTION.  THE  GENERAL 
FORMULA  FOR  NA  IS 

M-[|N»MAll/2] 


IN  GENERAL, 


SQRTfVARIMH 


VANCNI«INN/12I1NaNa1  - SUNtTIJII(TIJlV2-ll/IN«NICflANAil] 


WHERE  TCJ|«  THE  NUMBER  OF  OBSERVATIONS  IN  THE  JTH  GROUP  OF  TIED 
OBSERVATIONS  ANO  SUN  INDICATES  SUMMING  ITIJIIITI Jlt2»AI  OVER  ALL  2 
GROUPS.  FOR  UNTIED  OBSERVATIONS,  WHEN  THERE  ARE  NO  TIED 

OBSERVATIONS,  VARIMI  REOXES  TO 

VARCHI  > NNINAMAll/12 


ANO 


H-CNCNaNaII/2] 

WA  

SQRTCNN(NANAii/12l 

THE  LEVEL  OF  SIGNIFICANCE  OF  MA  IS  THE  PROBABILITY  OF  FINDING  A RANDOM 
VALUE  OF  NA  GREATER  THAN  THE  OBTAINED  VALUE  OF  NA. 


THE  WILCOXON  RANK  SUN  TEST  MAY  BE  USED  INSTEAD  OF  THE  UNPAIRED 
T-TEST  TO  DISCERN  INFORMATION  ABOUT  THE  EQUALITY  OF  THE  CENTERS  OF  TWO 
POPULATIONS. 


II.  THE  SIEGEL-TUKEY  TEST  MAY  BE  USED  TO  TEST  HYPOTHESES  ABOUT  THE 
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DIFFERENCES  IN  THE  SPREAD  OR  VARIANCES  OF  TWO  POPULATIONS.  FOR 
EXAMPLE.  IT  MIGHT  BE  USED  TO  TEST  THE  HYPOTHESIS  THAT  THE  PRECISION  OF 
TMO  MEASURING  INSTRUMENTS  IS  THE  SAME.  IN  GENERAL.  THE  NULL 
HYPOTHESIS  IS  VARCPVl-VARIPXI.  THE  K»N^N  OBSERVATIONS  ARE  CROEREO 
FROM  least  TO  GREATEST.  RANKS  ARE  THEN  ASSIGNED  BY  GIVING  RANK  1 TO 
THE  SMALLEST  OBSERVATION.  RANK  2 TC  THE  LARGEST.  RANK  3 TO  THE  SECOND 
LARGEST.  RANK  4 TO  THE  SECOND  SALLEST  AND  SO  ON.  THE  ORDERED  N«N 
OBSERVATIONS  THEN  HAVE  RANKS 

1.4.5.B.9.....(H>NI.....7.6.3.2. 

TIED  OBSERVATIONS  ARE  ASSIGNED  THE  AVERAGE  RANK  OF  THE  TIED 
OBSERVATIONS.  AS  IN  THE  RANK  SUM  TEST.  THE  SUM  OF  THE  N RANKS  OF  THE 
V VARIABLE  IS  THE  TEST  STATISTIC.  M.  A PRIMARY  ASSUMPTION  OF  THE 
SIEGEL-TUKEY  TEST  IS  THAT  THE  CENTERS  OF  THE  TWO  POPULATIONS  ARE 
COINCIDENT.  IF  THERE  IS  REASON  TO  BELIEVE  THAT  THE  CENTERS  ARE  NOT 
COINCIDENT.  A TRANSFORMATION  MAY  BE  PERFORMED  ON  THE  OBSERVATIONS  TO 
EQUALIZE  THE  MEDIANS.  CAUTION  SHOULD  BE  USED  SINCE  ADJUSTMENT  OF  THE 
DATA  ALTERS  THE  NULL  DISTRIBUTION  OF  W.  IT  IS  PROBABLY  BETTER  TO  USE 
THE  F'TEST  FOR  COMPARISONS  OF  2 VARIANCES  WHEN  THERE  IS  REASON  TO 
BELIEVE  THAT  THE  POPULATIONS  DO  NOT  HAVE  THE  SANE  CENTER. 


REFERENCES 

HOLLANDER.  M.  AND  O.A.  WOLFE  (1973)  NON-PARANETR IC  STATISTICAL 
METHODS.  WILEY.  NEW  YORK.  PP  67-75 

CONOVER.  W.J.  (1971)  PRACTICAL  NONPARAMETRIC  STATISTICS.  WILEY.  NEW 

YORK. 
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WILCOXONTS  SIGNED  RANKS  TEST 


1 8 

tSNRNK  lltl2 


PARAMETER  FIELD  (COLUMNS  8-79) 

11 

INPUT  VARIABLES  TO  8£  COMPARED 

12 


OUTPUT 

1.  THE  RANKS  OF  THE  ABSOLUTE  VALUES  OF  THE  DIFFERENCES 
BETWEEN  THE  PAIRED  VARIABLES  ARE  PRINTED. 

2.  THE  SUM  OF  THE  RANKS  CORRESPONDING  TO  POSITIVE  DIFFERENCESt  T*t 

THE  SUM  OF  THE  RANKS  CORRESPONDING  TO  NEGATIVE  OIFFERENCES»  T-f 

AND  THE  MINIMUM  OF  THESE  TWO  SUMS.  T,  ARE  OUTPUT.  THE  LEVEL 

OF  SIGNIFICANCE  OF  T I S ALSO  OUTPUT. 

3.  THE  LARGE  SAMPLE  APPROXIMATION  OF  -T-  AND  THE  LEVEL  OF 
SIGNIFICANCE  OF  THE  LARGE  SAMPLE  APPROXIMATION  ARE  PRINTED. 


USAGE 

1.  A MAXIMUM  OF  500  OBSERVATIONS.  DIFFERENCES.  MAY  BE  USED. 

2.  THE  NUMBER  OF  OBSERVATIONS  FOR  THE  TWO  INPUT  VARIABLES 
MUST  BE  EQUAL. 


EXAMPLE 

CARD  COMMENTS 

1 8 

8SNRNK  3.5  A SIGNED  RANKS  TEST  WILL  BE 

PERFORMED  FOR  VARIABLES  3 AND  5. 


c 


STATISTICAL  ROUTINE  SPECIFICATION  CAROS 


46 


rilLCOXONtS  SIGNED  RANK  TEST  IS  USED  TO  TEST  HYPOTHESES  ABOUT  THE 
MEDIAN  OF  THE  DIFFERENCES  OF  2 PAIRED  POPULATIONS. 

CONSIDER  A SET  OF  PAIRED  OBSERYAT lONSt  CXltVll*  CX2tV2lt***tfXNt 
VNI.  FROM  THIS  SET  N DIFFERENCES  ARE  CALCULATED? 

01  = Xl-Yl 

02  =«  X2-Y2 


• • • 

ON  » XN-YN. 

THE  NULL  HYPOTHESIS  OF  NO  DIFFERENCES  BETWEEN  THE  PAIRED  OBSERVATIONS 
MAY  BE  FORMULATED  AS  ME0C0l«0»  WHERE  NEO(O)  IS  THE  MEDIAN  OF  THE 
DIFFERENCES.  ALTERNATIVE  HYPOTHESES  ARE  MEOIOlPOt  NE0f0l>0.  OR 
MEOIOKO. 

TO  TEST  THE  NULL  HYPOTHESIS.  THE  ABSOLUTE  VALUES  OF  THE 
DIFFERENCES  ARE  ORDERED  FROM  LEAST  TO  GREATEST.  RANKS  OF  1 TO  N ARE 
THEN  ASSIGNED  TO  THE  ORDERED  VALUES.  THE  SUM  OF  THE  RANKS  OF  THE 
POSITIVE  DIFFERENCES.  T*.  AND  THE  SUM  OF  THE  RANKS  OF  THE  NEGATIVE 
DIFFERENCES.  T-.  ARE  CCMPUTEO.  TNIN.  THE  MINIMUM  OF  J*  AND  T«*  MAY  BE 
USED  AS  THE  TEST  STATISTIC  WHEN  THE  ALTERNATE  HYPOTHESIS  IS  NEOIOlPO. 
THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  T LESS  THAN  OR  EQUAL  TO 
THE  OBTAINED  TNIN  IS  WRITTEN  AS  THE  LEVEL  OF  SIGNIFICANCE.  THIS  IS 
EQUIVALENT  TO  THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  T GREATER 
THAN  OR  EQUAL  TO  TMAX.  WHERE  TNAX  IS  THE  MAXIMUM  OF  AND  T*.  THUS 
TO  OBTAIN  THE  SMALLEST  SIGNIFICANCE  LEVEL  AT  WHICH  THE  HYPOTHESIS 
NE0CDI>0  MAY  BE  REJECTED  IN  FAVOR  OF  THE  ALTERNATE  HYPOTHESIS  NEOCOMO 
USE  TWICE  THE  VALUE  OF  THE  SIGNIFICANCE  LEVEL  PRINTED.  IF  THE 
ALTERNATE  HYPOTHESIS  OF  INTEREST  IS  NE0f0)>0.  FIND  THE  PROBABILITY  OF 
FINDING  A RANDOM  VALUE  OF  T GREATER  THAN  OR  EQUAL  TO  THE  OBTAINED 
IF  THE  ALTERNATE  HYPOTHESIS  IS  MEOfOKO.  FIND  THE  PROBABILITY  OF 
FINDING  A RANDOM  VALUE  OF  T GREATER  THAN  OR  EQUAL  TO  THE  OBTAINED 
VALUE  OF  T-.  THE  ABOVE  PROBABILITIES  MAY  BE  FOUND  USING  MSOST  IF  THE 
NUMBER  OF  DIFFERENCES  IS  LESS  THAN  OR  EQUAL  TO  30. 

ANY  PAIR  (XI. YI)  IS  EXCLUDED  FROM  CONSIDERATION  IF  XI«Y1.  AND  THE 
NUMBER  OF  OBSERVATIONS  IS  REDUCED  ACCORDINGLY.  IF  THERE  ARE  TIES 
AMONG  THE  ABSOLUTE  VALUES  OF  THE  DIFFERENCES.  THE  AVERAGE  RANK  OF  THE 
TIED  GROUP  IS  ASSIGNED. 

THE  LARGE  SAMPLE  APPROXIMATION  TO  TNIN  IS  CALCULATED  AS  FOLLOWS? 

T-CN(N«1I/4I 

SQNTIVARITII 

IN  GENERAL 

VARCTI  « IN(N«1M2N«1I/241  - I SUMCTI 1 1 1 CT(  I I-IIITI 11411/21 

WHERE  T IS  TNIN.  N IS  THE  NUMBER  CF  DIFFERENCES.  Till  IS  THE  NUMBER  OF 
OBSERVATIONS  IN  THE  ITH  TIED  GROUP.  AND  SUN  INDICATES  SUMMING  OVER  ALL 
I GROUPS.  FOR  AN  UNITED  OBSERVATION  Tfll>l.  WHEN  THERE  ARE  NO  TIES 
THE  LARGE  SAMPLE  APPROXIMATION  REDUCES  TO 

T*CNCN4ll/4) 

at  . 


SQRT I N I N4 1 } I 2N4 1 1 /24 1 
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FJR  LARGE  N*  AT  FCLLOWS  THc  STANDARD  NORMAL  DISTRIBUTION.  THE 
PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  AT  GREATER  THAN  THE  ABSOLUTE 
value  of  the  OBTAINED  AT  IS  ALSO  PRINTED. 


REFERENCES 

HOLLANOERf  M.  AND  C.  A.  WOLFE  (19731  NON-FARANETRIC  STATISTICAL 
METHODS.  WILEY,  NEW  YORK.  PP  27-33. 

NOETHER,  G.  (19711  INTRODUCTION  TO  STATISTICS?  A FRESH  APPROACH. 

HJUGHTON-MIFFLIN  CO.,  BOSTON.  PP  122-129. 
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ONE  WAV  ANALYSIS  OF  VARIANCE 


1 8 

SANOVA 


PARANETER  FIELD  (COLUMNS  8-79) 

11 

12 


• • 

..  INPUT  VARIABLES  (TREATMENTS) 
• • 

ISO 


OUTPUT 

1.  THE  NUMBERS  OF  OBSERVATIONS*  MEANS*  AND  STANDARD  DEVIATIONS 
ARE  PRINTED  FOR  EACH  VARIABLE,  TREATMENT*  SPECIFIED. 

2.  AN  ANALYSIS  OF  VARIANCE  TABLE  FOR  A ONE  WAY  CLASSIFICATION 
WITH  EQUAL  OR  UNEQUAL  NUMBERS  OF  REPLICATES  IS  PRINTED. 

THE  TABLE  INCLUDES  THE  BETWEEN  TREATMENTS,  WITHIN  TREATMENTS* 
AND  TOTAL  SUMS  OF  SQUARES  AND  DEGREES  OF  FREEDOM,  THE 
BETWEEN  TREATMENTS  AND  WITHIN  TREATMENTS  MEAN  SQUARES*  THE 
F-RATIO  OF  THESE  TWO  MEAN  SQUARES*  AND  THE  PROBABILITY*  P, 

OF  FINDING  A RANDOM  VALUE  OF  -F-  GREATER  THAN  THE  OBTAINED 
VALUE. 


USAGE 

1.  A MAXIMUM  OF  50  INPUT  VARIABLES  MAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 
CAROS 
1 8 
SANOVA  5,9*10 


SANOVA  ALL 


COMMENTS 

A ONE  WAY  ANALYSIS  OF  VARIANCE 
WITH  3 TREATMENTS*  VARIABLES 
5*  9*  AND  10*  WILL  BE  PERFORMED. 
A ONE  WAY  ANALYSIS  OF  VARIANCE 
WILL  BE  PERFORMED  USING  ALL 
INPUT  VARIABLES  AS  TREATMENTS. 
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IN  GENERALt  THE  ANALYSIS  OF  VARIANCE  IS  A METHOD  OF  PARTITIONING 
THE  OVERALL  VARIATION  IN  AN  EXPERIMENT  INTO  RECOGNIZED  SOURCES  OF 
VARIATION.  IN  A ONE-WAY  DESIGN  THE  TOTAL  VARIATION  IS  CONSIDERED  TO 
8E  EQUAL  TO  THE  SUM  OF  THE  VARIATION  BETWEEN  TREATMENTS  AND  THE 
VARIATION  ASSOCIATED  WITH  EXPERIMENTAL  ERROR,  (WITHIN  TREATMENTS!. 

IN  THE  ONE-WAY  DESIGN  WITH  K TREATMENTS  AND  R(I)  REPLICATES  IN 
THE  ITH  TREATMENT,  AN  OBSERVATION  CAN  BE  DESCRIBED  BY  THE  MODEL 

YdJI  * M ♦ Till  ♦ ECU) 

FOR  I * 1,...,K 

J = 1,...,R(I) 

WHERE 

Y(IJ)  IS  THE  JTH  OBSERVATION  IN  THE  ITH  TREATMENT, 

M IS  THE  OVERALL  MEAN, 

Tm  IS  THE  EFFECT  OF  THE  ITH  TREATMENT,  AND 
EIIJ!  IS  A RANDOM  ERROR  ASSOCIATED  WITH  THE  IJTH  OBSERVATION. 

IT  IS  ASSUMED  THAT  THE  SUM  OF  THE  TIDtS  IS  ZERO  AND  THAT  THE  EUJItS 
ARE  INDEPENDENT  AND  NORMALLY  DISTRIBUTED  WITH  MEAN  0 AND  AND  A COMMON 
VARIANCE.  (IT  SHOULD  BE  NOTED  THAT  THE  ASSUMPTIONS  ABOUT  THE  ElUlfS 
ARE  MORE  STRINGENT  IN  THIS  MODEL  THAN  IN  THE  CORRESPONDING  MODEL  FOR 
THE  KRUSKAL-WALLISt  TEST.) 

THE  NULL  HYPOTHESIS  TESTED  IS  ALL  T(I!*0  FOR  I>1,...,K.  THIS  IS 
EQUIVALENT  TO  THE  HYPOTHESIS  THAT  THE  MEANS  OF  ALL  TREATMENTS  ARE 
EQUAL.  THE  USUAL  ALTERNATE  HYPOTHESIS  IS  NOT  ALL  THIS  IS 

EQUIVALENT  TO  THE  STATEMENT  THAT  AT  LEAST  ONE  PAIR  OF  TREATMENT  MEANS 
ARE  NOT  EQUAL. 


THE  COMPLETE  ANALYSIS  OF  VARIANCE  TABLE  HAS  THE  FORM? 


* SOURCE 

* 

SUM  OF 

* 

DEGREES  OF 

* 

MEAN 

MM 

« 

F • 

* 

* 

SQUARES 

* 

FREEDOM 

* 

SQUARE 

• 

•TREATMENTS 

« 

SSTRT 

* 

K-l 

* 

MSTRT-SSTRT/K-l 

• 

MSTRT/MSE* 

* 

« 

• 

•WITHIN 

* 

* 

* 

* 

•TREATMENTS 

• 

SSE 

* 

SUMIR(I)-l) 

• 

MSE«SSE/SUM(RI Il-l) 

• 

* 

* 

* 

* 

* 

•TOTAL 

SST 

* 

N-1 

* 

• 

WHERE  K IS  THE  NUMBER  OF  TREATMENTS,  RID  IS  THE  NUMBER  OF  REPLICATES 
IN  THE  ITH  TREATMENT,  N IS  THE  TOTAL  NUMBER  OF  OBSERVATIONS,  AND  SUM 
INDICATES  SUMMING  OVER  ALL  TREATMENTS.  FOR  THE  DEFINITIONAL  AND 
COMPUTATIONAL  FORMULAE  OF  THE  SUMS  OF  SQUARES,  PLEASE  SEE  THE 
REFERENCE  LISTEO  BELOW. 

THE  OUTPUT  INCLUDES  THE  PROBABILITY,  P,  OF  FINDING  A RANDOM  VALUE 
OF  F LARGER  THAN  THE  OBTAINED  F.  THIS  IS  THE  SMALLEST  SIGNIFICANCE 
LEVEL  AT  WHICH  THE  NULL  HYPOTHESIS  ALL  Tlll«0  MAY  BE  REJECTED  IN  FAVOR 
OF  THE  ALTERNATE  HYPOTHESIS  NOT  ALL  Tfll«0. 
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REFERENCE 


STEELE*  R.  G.  0.  ANO  J.  H.  TORRIE  (19601  PRINCIPLES 
STATISTICS.  MCGRAU'HILL*  NEW  YORK.  PP  99-119. 


AND  PR^ICCOURCS  OF 
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k 


1 8 

80NNRT 


PARANETER  FIELD  {COLUMNS  8-791 

11 

12 

• • 

..  INPUT  VARIABLES 
• • 

ISO 


OUTPUT 

1.  THE  MEANS  OF  THE  SPECIFIED  VARIABLES  ARE  PRINTED. 

2.  THE  ERROR  MEAN  SQUARE  IS  PRINTED. 

3.  VARIABLES  ARE  TESTED  FOR  A SIGNIFICANT  DIFFERENCE 
BETWEEN  THE  MEANS  AT  THE  .05  LEVEL  {ALPHA  » .051. 


USAGE 

1.  A MAXIMUM  OF  50  VARIABLES  MAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 

CAROS  COMMENTS 

1 8 

80NMRT  1»3»6  THE  MEANS  OF  VARIABLES  It  3t  AND  5 ARE  COMPARED. 

80NMRT  ALL  THE  MEANS  OF  ALL  VARIABLES  ARE  COMPARED. 
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AS  MENTIONED  PREVIOUSLYf  A ONE-WAY  ANALYSIS  OF  VARIANCE  MAY  BE 
USED  TO  TEST  THE  HYPOTHESIS  THAT  THE  TREATMENT  MEANS  LN  A ONE-WAY 
DESIGN  ARE  ALL  EQUAL  AGAINST  THE  ALTERNATE  HYPOTHESIS  THAT  AT  LEAST 
ONE  PAIR  OF  TREATMENT  MEANS  ARE  NOT  EQUAL.  AN  INVESTIGATOR  MAY  BE 
INTERESTED  IN  MORE  SPECIFIC  INFORMATION  ABOUT  THE  TREATMENT  MEANS  THAN 
PROVIDED  BY  THE  ANALYSIS  OF  VARIANCE.  IT  MAY  BE  OF  INTEREST  TO  KNOW 
WHICH  TREATMENT  MEANS  DIFFER.  OUNCANtS  NEW  MULTIPLE  RANGE  TEST  IS  A 
METHOD  OF  COMPARING  ALL  PAIRWISE  COMBINATIONS  OF  TREATMENT  MEANS. 

CONSIDER  AN  EXPERIMENT  WITH  K TREATMENTS  AND  R REPLICATES  PER 
TREATMENT.  THE  TOTAL  NUMBER  OF  OBSERVATIONS  IS  N«RK.  THERE  ARE  THREE 
STEPS  INVOLVED  IN  OUNCANfS  NEW  MULTIPLE  RANGE  TEST. 

I.  SXBARt  THE  standard  ERROR  OF  THE  MEAN  IS  CALCULATED  AS 

SXBAR-SQRTC ERROR  MEAN  SQUARE/RI 

WHERE  R IS  the  NUMBER  OF  REPLICATES  PER  TREATMENT  AND  THE  ERROR  MEAN 
SQUARE  IS  THE  EQUIVALENT  OF  THE  WITHIN  TREATMENTS  MEAN  SQUARE  IN  THE 
ANALYSIS  OF  VARIANCE.  (NOTE  — WHEN  THERE  ARE  UNEQUAL  REPLICATES  PER 
TREATMENT  THE  HARMONIC  MEAN  OF  THE  NUMBERS  OF  REPLICATES  IS  USED 
INSTEAD  OF  R.)  SXOAR  HAS  R-K  DEGREES  OF  FREEDOM.  SIGNIFICANT  RANGES 
AT  THE  LEVEL  ARE  OBTAINED  FOR  THE  APPROPRIATE  DEGREES  OF  FREEOOMt 
N-K»  AND  FOR  THE  NUMBER  OF  MEANS  TO  BE  COMPAREOt  P«2t**.tX.  THESE 
RANGES  ARE  CONTAINED  IN  A TABLE  IN  THE  PROGRAM.  THE  APPROPRIATE 
RANGES  ARE  THEN  MULTIPLIED  BY  SXBAR  TO  GIVE  THE  CRITICAL  VALUE  WHICH  2 
MEANS  P STEPS  APART  ON  AN  ORDERED  SCALE  MUST  EXCEED  TO  BE  CONSIDERED 
DIFFERENT  AT  THE  S'.  LEVEL. 

II.  THE  TREATMENT  MEANS  ARE  RANKED  FROM  LEAST  TO  GREATEST. 

III.  THE  DIFFERENCES  BETWEEN  TREATMENT  MEANS  ARE  COMPARED  TO  THE 
CRITICAL  VALUES.  THE  MEANS  ARE  COMPARED  IN  THE  FOLLCWING  ORDER? 
LARGEST  MEAN  WITH  SMALLEST  MEANt  LARGEST  WITH  SECOND  SMAiLLEST....f 
LARGEST  WITH  SECOND  LARGEST t THEN  SECOND  LARGEST  WITH  SMALLEST*  AND  ON 
THROUGH  SECOND  SMALLEST  WITH  SMALLEST.  A DIFFERENCE  BETWEEN  TWO  MEANS 
IS  DEEMED  SIGNIFICANT  IF  IT  EXCEEDS  ITS  CRITICAL  VALUE  AND  IS  NOT 
WITHIN  A NON-SIGNIFICANT  RANGE. 

AN  EXAMPLE  MAY  HELP  TO  CLARIFY  THE  ABOVE.  SUPPOSE  WE  HAVE  FIVE 
TREATMENT  MEANS  A*  B*  C»  0*  AND  E.  IN  STEP  1*  4 CRITICAL  VALUES*  I* 
II*  III*  AND  IV*  WOULD  BE  CALCULATED  FOR  P (NUMBER  OF  STEPS  BETWEEN 
MEANS l«  2*  9*  4*  AND  5.  SUPPOSE  THE  MEANS  ARE  RANKED  FROM  SMALLEST  TO 
LARGEST  AS  FOLLOWS? 

B A E 0 C. 

THE  DIFFERENCE  C-B  WOULD  BE  CONSIDERED  FIRST.  C-B  WOULD  BE  CCMPARED 
TO  CRITICAL  VALUE  IV  SINCE  THE  RANGE  ENCOMPASSED  BY  C AND  B CONTAINS  5 
MEANS.  IF  C-B<IV»  WE  WOULD  CONCLUDE  THAT  ALL  THE  MEANS  ARE  THE  SAME 
AT  THE  .05  LEVEL.  HOWEVER*  IF  C-B>IV*  WE  WOULD  CONTINUE  BY  COMPARING 
C-A  WITH  CRITICAL  VALUE  III  AND  SO  ON  UNTIL  A NON-SIGNIFICANT  RANGE 
WAS  FOUND  OR  UNTIL  THE  LAST  COMPARISON*  A-B*  HAS  MADE. 

THE  RESULTS  ARE  SUMMARIZED  BY  UNDERSCORING  MEANS  WHICH  ARE  THE 
SAME  AT  THE  .05  LEVEL. 
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REFERENCE 

STEELEt  R.  G.  0*  AND  J.  F.  TORRIE  11960)  PRINCIPLES  AND  PROCEDURES  OF 
STATISTICS.  MCGRAW-HILL,  NEW  YORK,  PP  107-109. 
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NEWMAN  > KEULtS  MULTIPLE  RANGE  TEST 

1 8 

$NKNRT  IltI2f««115 


PARAMETER  FIELD  (COLUMNS  8-79) 

11 

12 

• • 

..  INPUT  VARIABLES 

• • 

115 


OUTPUT 

1.  MEANS  OF  THE  SPECIFIED  VARIABLES  ARE  PRINTED. 

2.  THE  ERROR  MEAN  SQUARE  IS  CCMPUTEO. 

3.  VARIABLES  ARE  TESTED  FOR  SIGNIFICANT  DIFFERENCES 
BETWEEN  THE  MEANS  AT  THE  .05  LEVEL  (ALPHA*. 05). 


USAGE 

1.  A MAXIMUM  OF  15  INPUT  VARIABLES  HAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 

CAROS  COMMENTS 

1 B 

SNKMRT  It20t21  THE  MEANS  CF  VARIABLES  It  20  AND  21 

ARE  COMPARED. 

SNKMRT  ALL  THE  MEANS  OF  ALL  VARIABLES  ARE  CCHPAREO. 


c 


STATISTICAL  ROUTINE  SPECIFICATION  CAROS 


55 


THE  PROCEDURES  INVOLVED  IN  PERFORMING  THE  NEWMAN-KEUL t S MULTIPLE 
RANGE  TEST  ARE  IDENTICAL  TO  THOSE  USED  IN  OUNCANtS  NEW  MULTIPLE  RANGE 
TEST  EXCEPT  THAT  DIFFERENT  RANGE  VALUES  ARE  USED.  THE  RANGE  VALUES 
FOR  THE  TWO  TESTS  ARE  IDENTICAL  WHEN  THE  NUMUER  OF  MEANS  BEING 
COMPARED  EGUALS  2.  WHEN  THE  NUMBER  OF  MEANS  IN  A RANGE  EXCEEDS  2* 
RANGE  VALUES  FOR  THE  NE WMAN-KEUL t S TEST  ARE  LARGER  THAN  CORRESPONDING 
RANGE  VALUES  FOR  OUNCANtS  TEST. 

REFERENCE 

STEELE*  R.  G.  0.  AND  J.  H.  TORRIE  (1960)  PRINCIPLES  AND  PROOEPURES  OF 
STATISTICS.  MCGRAW-HILL*  NEW  YORK.  PP  110-111. 


c 
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THE  KRUSKAL'UALLIS  STATISTIC 

1 8 

SKRUML  11.12. ••••ISO 


PARAMETER  FIEtO  I COLUMNS  8-79) 

11 

12 

• • 

..  INPUT  VARIABLES 

• • 

ISO 


OUTPUT 

U THE  RANK  OF  EACH  VALUE  IN  THE  DATA  SET  IS  PRINTED,  THE 
SUMS  OF  The  RANKS  FOR  EACH  VARIABLE  ARE  GIVEN.  TIES  ARE 
GIVEN  THEIR  AVERAGE  BANKING. 

2.  THE  KRUSKAL  - TALLIS  -H-  STATISTIC  AND  ITS  LEVEL  OF 
SIGNIFICANCE  ARE  PRINTED. 


USAGE 

1.  A MAXIMUM  OF  50  VARIABLES  MAY  BE  USED. 

2.  THF  TI.TAL  NUMBER  OF  CBSEBVATICNS  CANNOT  EXCEED  5000. 


EXAMPLES 

I 8 

CAROS 

SKRUWL  1.2.0 
$KRUML  ALL 


COMMENTS 

THE  H STATISTIC  IS  CALCULATED 
USING  VARIABLES  1.  2 AND  6. 
THE  H STATISTIC  IS  CALCULATED 
USING  ALL  VARIABLES. 


c 
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THE  KRUSKAL-wALLIS  TFiT  IS  Afj  EXTENSION  CF  WiLCCXCNtS  RANK  SUM 
TlST  from  2 INUcPtNOEM  GPOUPSt  TREATMcNTS  OR  CLASSIFICATIONS  TO  K 
IfiOEPENJENT  GROUPS.  THE  BASIC  MCOLL  IS 
Y(IJ»=  M ♦ T(J)  ♦ EIIJI 


FOR  I*It...tNlO) 

fK 


Y(IJ)  IS  THE  ITh  OBSERVATION  IN  THE  GRCUP  Jt 
■1  IS  THc  IjVlRALL  mean, 

TUI  IS  the  effect  uf  the  jth  treatment, 

tllJI  IS  A RANGCM  EKRCR  ASSOCIATED  WITH  THE  IJTH  OBSERVATION, 
MJ)  IS  THc  NC.  UF  REPLICATES  IN  TREATMENT  J. 


AND 


IT  IS  ASSUMED  THAT  THE  SUN  OF  THE  TIJItS  IS  ZERC,  AND  THAT  THE  EllJIfS 
ARE  MUTUALLY  INDEPENDENT  ANO  CCME  FROM  THE  SAME  CONTINUOUS  POPULATION. 

the  null  hypothesis  is  M07  Tm«TI2l«...>TC  Jl , AGAINST  THE 
ALTERNATIVE  THAT  NOT  ALL  THE  TIJItS  ARE  EQUAL. 


ALL  N=Nm+N(2»N ♦NIKI  OBSERVATIONS  ARE  ORDERED  FROM  LEAST  TO 

greatest  ANO  RANKS  1 TO  N ARE  ASSIGNED.  THE  SUM  OF  THE  RANKS  IS  THEN 
C.iMPUTEQ  FOR  EACH  TREATMENT  GROUP.  IF  RIIJ)  IS  THE  RANK  OF  YflJI, 
THEN  THE  SUM  OF  THE  RANKS  FOR  TREATMENT  J IS  RfJI*  SUNfRfIJII,  WHERE 
S J-4  INDICATES  SUMMING  OVER  1 FRCP  1 TO  NIJI.  THE  AVERAGE  RANK  FOR 
TREATMENT  J IS  HRIJI-RI Jl/Nf Jl . THE  OVERALL  AVERAGE  OF  THE  RANKS  IS 
MR=fNFll/2. 


THE  KRUSKAL-WALL  IS  STATISTIC  IS  OEFINED  AS 

H > I12/NfN»ll  lISUMINfJIlfNRUl  - NR|f2llt 

WHERE  SUM  INDICATES  SUMMING  OVER  J FROM  1 TC  K.  THIS  FORMULA  CAN  BE 
CONVERTED  TO  THE  CCMPUTATICNAL  FORMULA 

H « Ill2/NfN«lll(SUMffRf  JM2I/NIJII  l«3(NAll. 

fir  large  NIJI,  H FOLLOWS  AN  APPROXIMATE  CHI-SQUARE  DISTRIBUTION  WITH 
K-1  DEGREES  CF  FREEOCH.  THE  PROBABILITY  OF  FINDING  A LARGER  H BY 
CHANCE  ALONE  IS  GIVEN  AS  THE  LEVEL  GF  SIGNIGICANCE.  THIS  IS  THE 
LJWEST  SIGNIFICANCE  LEVEL  AT  WHICH  THE  NULL  HYPOTHESIS  MAY  BE  REJECTED 
IN  FAVUR  OF  THE  ALTERNATE  HYPOTHESIS. 


WHEN  THERE  ARE  TIED  OBSERVATIONS,  THE  AVERAGE  OF  THE  RANKS  OF  THE 
TIFU  VALUES  ARE  ASSIGNED  ANO  THE  STATISTIC 

H*  - H/Il-f ISUNISfL)«3-SlLII/INt3Hllll 

IS  UScO  INSTEAD  OF  H.  IN  THE  ABOVE  FORMULA,  SUM  INDICATES  SUMMING 
OVER  L FROM  1 TO  6 WHERE  6 IS  THE  NUMBER  OF  TIED  GROUPS,  ANO  SILI  IS 
THE  NUMBER  OF  TIED  VALUES  IN  GROUP  L.  FOR  UNTIED  OBSERVATIONS  SILI*1. 
FOR  DATA  WITH  ND  TIES  H*«H. 

FOR  SMALL  NIJI  THE  USER  IS  REFERRED  TO  TABLES  OF  THE 
KRUSKAL-WALL IS,  H,  STATISTIC  TO  DETERMINE  SIGNIFICANCE  OF  H.  THESE 
TABLES  ARE  TOG  LENGTHY  TO  BE  PRESENTED  HERE.  ONE  SOURCE  IS  TABLE  A. 7 
IN  HCLLANOER  ANO  WOLFE  (1973). 
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AEFERENCES 

HOLLANOERi  AND  0.  A.  WOLFE  (197J).  NON-PARANETRIC  STATISTICAL 
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MULTIPLE  COMPARISONS  BASED  ON  THE  KRUSKAL-UALLIS  H STATISTIC 

1 a 

SKRUWC  Il»I2f«tI50 


PARAMETER  FIELD  (COLUMNS  8-791 

11 

12 


INPUT  variables 


ISO 


OUTPUT 

1.  ALL  OUTPUT  GIVEN  BY  SKRUML. 

2.  THE  MULTIPLE  COMPARISON  PROCEDURE  INCLUDES  A LIST 
□F  THE  TREATMLNT,  VARIABLEt  PAIRS  COMPARED,  THE 
ABSOLUTE  DIFFERENCE  BETWEEN  THF  AVERAGE  RANKS  OF 
TREATMENT  PAIRS,  THE  TEST  STATISTICS,  2U,JltS,  AND 
THE  LEVELS  CF  SIGNIFICANCE  Of  THE  Z(I,J)*S. 


USAGE 

1.  A MAXIMUM  OF  50  VARIABLES  MAY  BE  USED. 

2.  THE  TOTAL  NUMBER  OF  CBSERVATICNS  CANNOT  EXCEED  5000. 


EXAMPLE 

CARO 

1 

iKRUHC 


COMMENTS 

8 

1,3,6  THE  H STATISTIC  IS  CALCULATED  USING 

VARlABLcS  1,  3 AND  6.  THE  DIFFERENCES 
OF  THE  AVERAGE  RANKS  CF  THE 
VARIABLE  PAIRS  (1,31,  (1,61 
AND  (3,6)  ARE  TESTED. 
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this  progra>»  calculates  the  kruskal>hallis  h statistic  as  in 
$KRuwL»  AND  p::mforms  all  pairnise  comparisons  between  the  average 

RANKS  OF  the  K TREATMENTS. 

IF  there  ARt  K TREATMENTS.  K(K-l)/2  COMPARISONS  ARE  PEKFORMEO. 
THE  STATISTICS  CALCULATEO  FCR  ALL  I.J,  AND  I/J  ARE 


AaSI  RbARin  - RoARIJ)  1 
SQRTlN(N-i)(l/Nm  ♦ 1/NIJ)1/121 


WHERE 

R5AR(II  AND  RBAR(J»  ARE  THE  AVERAGE  RANKS  OF  THE  ITH  AND  JTH 
TREATMENTS 

N a TOTAL  NUMBER  OF  CbSERVAT IONS 

Nin  AND  N(J)  = NUMBER  OF  OBSERVATIONS  IN  ITH  AND  JTH  TREATMENTS 
RESPECTIVELY. 

£(IJ)  FCLlOWS  a standard  NORMAL  OISTRIBLTION.  THE  LEVELS  OF 
SIGNIFICANCE  GIVEN  ON  THE  OUTPUT  ARE  THE  UPPER  TAIL  PROBABILITIES  FOR 
EACH  HU).  TO  MAKE  A DECISION  CONCERNING  ANY  GIVEN  PAIR,  CHOOSE  A 
DESIkcO  OVERALL  SIGNIFICANCE  LEVEL,  ALPHA.  THIS  WILL  INSURE  THAT  THE 
probability  will  be  AT  MUST  ALPHA  CF  DECLARING  THAT  TWO  OR  MORE 
TREATMENTS  DIFFER  WHEN  IN  FACT  ALL  K TREATMENTS  ARE  IDENTICAL.  THEN 
COMPUTE  C = ALPHA/KIK-1 ) . C CAN  THEN  BE  COMPARED  WITH  THE  LEVELS  OF 
SIGNIFICANCE  GIVEN  FCR  EACH  Z(IJ).  IF  THE  LEVEL  OF  SIGNIFICANCE  FOR  A 
PARTICULAR  ZIIJI  IS  LESS  THAN  Oft  EQUAL  TO  C,  WE  MAY  DECIDE  THAT  THE 
average  ranks  for  the  ITH  AND  JTH  TREATMENT  DIFFER  AT  THE  CHOSEN 
OVERALL  SIGNIFICANCE  LEVEL  ALPHA. 


REFERENCES? 

1.  NOETHER,  G.  (I9T1)  INTROOUCTION  TO  STATISTICS?  A fRESH  ARPROAOH. 

HOUGHTON-MIFFLIN  CO. , BCSTJN.  PP  1A7-IA8. 
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PEARSONtS  CORRELATION 


1 8 

$CORR  ll»12f.«I25 


PARAMETER  FIELD  ICOLUMNS  8-79) 
1 1 
IZ 

• • 

..  INPUT  variables 


OUTPUT 

1.  PEARSONtS  CCRRELATICN  CCFFFICIENTS  BETWEEN  ALL 
PAIRWISE  CGMBINATICNS  OF  INPUT  VARIABLES  AND 
CORPESPONOING  NUVtJERS  OF  C3SERVATIQNS  ARE  PRINTED. 

2.  THE  FOR.-)  OF  THF  OUTPUT  IS  ( CBSERVAT I CNS , CORRELATION). 


USAGE 

1.  A MAXINU^^  OF  25  VARIABLES  NAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 


CAROS 

a 

COMMENTS 

(CORK 

0 

1 f ^ 1 1)  t 

CGRRELATICNS 

OF 

VARIABLE  PAIRS 

(It^).  11.6) 

AND 

(4,6)  ARE  COMPUTED. 

SCORR 

ALL 

CORRELATIONS 

CF 

ALL  PAIRS  OF 

VARIABLES  ARF.  CALCULATED. 
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A CORRELATEON  COEFFICIENT  IS  A MEASURE  OF  THE  DEGREE  CF 
ASSOCIATION  BETWEEN  TWO  VARIABLES.  THERE  ARE  MANY  TYPES  OF 
CORRELATION  COEFEIC  lENTS . THE  ONE  CALCULATED  BY  THIS  PROGRAM  IS  THE 
PRODUCT  MOMENT  CORRELATION  COEFFICIENT*  R.  SPECIFICALLY,  R IS  A 
MEASURE  OF  THE  LINEAR  ASSOCIATION  BETWEEN  TWC  VARIABLES.  THE  VALUES 
OF  R CAN  RANGE  FROM  -1  (SMALL  VALUES  CF  ONE  VARIABLE  ARE  LINEARLY 
ASSOCIATED  WITH  LARGE  VALUES  OF  THE  OTHER  VARIABLE)  TO  ♦!  (VALUES  OF 
THE  TWO  VARIABLES  TEND  TO  INCREASE  TOGETHER).  A VALUE  OF  0 INDICATES 
NO  LINEAR  ASSOCIATION  BETWEEN  THE  VARIABLES. 

THE  PRODUCT  MOMENT  CORRELATION  IS  DEFINED  AS 


R 


SUMIIXI  D-XBARKYm-YBARl] 


S0RT((SUN(X(I)-XBAR)t21ISUM(Y(I)-Y8AR)«21] 


WHERE 

SUM  INDICATES  SUMMING  OVER  VALUES  OF  I > 1,2,...,N 

t IS  EXPONENTIATION 

X(I)  * THE  ITH  VALUE  OF  VARIABLE  X 

Y(I)  » THE  ITH  VALUE  CF  VARIABLE  Y 

XBAR  AND  YBAR  ARE  THE  SAMPLE  MEANS  CF  X AND  Y RESPECTIVELY 


THE  COMPUTING  AL^oORITHM  USED  IS 


R 


SUM  A8  - (SUM  A)  (SUM  B)/N 

SQKTtISUM( At2)-( (SUM  A ) t2 ) /N1( SUM( 3t 2)-( ( SUM  B)t2)/Nll 


WHERE 
A IS  X(  I) 

B IS  Y(I) 

N > NUMBERS  OF  X AND  Y VALUES 

F3R  TESTS  OF  HYPOTHESES  ABOUT  RHO*  THE  POPULATION  CORRELATION 
COEFFICIENT  (ESTIMATED  BY  R),  SEE  STEELE  AND  TORRIE  11960), 
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REFERENCES? 

STEELEt  R.G.O.  AND  J.H.  TORRIE  11960)  PKIMCIFLES  AMD  PROCEOURES  Of 
STATISTICS.  HCGRAS-HILLt  NEW  YORK.  PP  183-193. 
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KENOALLtS  RANK  CORRELATION 


1 8 

SRNKCR  Il»I2f««I25 


PARAMETER  FIELD  (CCLUMNS  0-791 
1 1 
12 

• • 

..  INPUT  VARIAbLES 

• • 

125 


OUTPUT 

1.  the  RANK  UF  EACH  VALUE  WITHIN  A VARIABLE  AND  THE 
VALUES  OF  EACH  VARIABLE  IN  A VARIABLE  PAIR  ARE 
PRINTED. 

2.  THE  -S-  STATISTIC,  KENOALLtS  CORRELATION  COEFFICIENT, 
(TAU),  THE  LARGE  SAMPLE  APPROXIMATION  TO  -S-,  AND  THE 
SIGNIFICANCE  LEVEL  OF  THE  LARGE  SAMOLE  APPROXIMATION 
ARE  GIVEN  FOR  ALL  PAIRWISE  COMBINATICNS  OF  THE  VARIABLES 
REQUESTED. 


USAGE 

1.  A MAXIMUM  OF  25  OBSERVATIONS  MAY  BE  USED. 

2.  THE  MAXIMUM  NUMBER  OF  OBSERVATIONS  PER  VARIABLE 
IS  50C. 


EXAMPLES 
CAROS 
1 B 

SRNKCR  1,4,6 

SRNKCR  ALL 


COMMENTS 

RANK  CORRELAITON  FOR  VARIABLE 
PAIRS  (1,4),  (1,6)  AND  (4,6) 

RANK  CORRELATION  FOR  ALL  VARIABLE 
PAIRS 
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RENOALLtS  RANK  CORRELATION  IS  A METHOD  CF  DISCERNING  THE 
ASSOCIATION.  IF  ANY.  UETWEEN  Th3  VARIABLES. 

SUPPOSE  WE  have  two  variables  with  N CBSERVATICNS  EACH?  (XIII. 
Ydll.  (X(2I.Y(2I)  .....(X(N).Y(NI).  THE  HYPOTHESIS  OF  INTEREST  IS  HO? 
X AND  Y ARE  INUEPENOENT  (UNASSOCIATED).  AN  INTUITIVE  APPROACH  TO 
DISCERNING  THE  RELATIONSHIP  BETWEEN  X AND  Y IS  TC  COUNT  HCw  CFTEN  THE 
RELATIVE  RANKINGS  WITHIN  EACH  VARIABLE  MOVE  IN  THE  SAME  DIRECTION  AND 
HOW  OFTEN  THEY  MOVE  IN  OPPOSITE  DIRECTIONS. 

IN  PARTICULAR  FOR  I AND  J=1.2.....N  AND  UJ.  IF  THE  PRODUCT 
rxm-X(J)llY(I)-Y(J)  I IS  GREATER  THAN  0.  (I.E.,  THE  OBSERVATIONS  ARE 
CHANGING  IN  THE  SAME  DIRECTION)  WE  CCUNT  ♦•I.  CONVERSELY.  IF  THE 
PRODUCT  IS  NEGATIVE.  WE  COUNT  -1.  IF  THE  PRODUCT  IS  0.  WE  COUNT  0. 
BECAUSE  AT  LEAST  ONE  CF  THE  VARIABLES  HAD  NO  CHANGE  BETWEEN 
OBSERVATIONS  1 AND  J. 

THE  SUM  OF  THE  KS,  -ItS.  AND  OtS  IS  THE  STATISTIC  S.  IF  ALL  N 
OBSERVATIONS  OF  X AND  Y MOVED  IN  THE  SAME  DIRECTION.  S WOULD  EQUAL 
NlN-D/2.  WHILE  IF  ALL  OBSERVATIONS  MOVED  IN  OPPOSITE  DIRECTIONS.  S 
WOULD  EQUAL  -N(N-il/2.  IT  wOUL 0 BE  HELPFUL  TO  HAVE  A STATISTIC  WHICH 
GAVE  THE  STRENGTH  CF  ASSOCIATION  BETWEEN  THE  TWO  VARIABLES  CN  A SCALE 
FROM  -I  lEVERY  INCREASE  IN  ONE  VARIABLE  IS  ACCOMPANIED  BY  A DECREASE 
IN  THE  OTHER  VARIABLE)  TC  (THE  VARIABLES  INCREASE  TOGETHER).  SUCH 
A STATISTIC  IS  TAO  WHERE  TAU  IS  THE  OBTAINED  VALUE  OF  S DIVIDED  BY  THE 
MAXIMUM  OF  S«  THE  MAXIMUM  OF  S IS  N(N-l)/2  SO 


TAU  

N(N-l)/2 


2S 

MN-l) 


the  significance  of  TAU  CAN  RE  EVALUATED  FOR  LARGE  SAMPLES  IN  THE 
following  way?  THE  LARGE  SAMPLE  APPROXIMATION  IS 


S* 


s 

SQRTtN(N-l) (2N*5)/18I 


FOR  LARGE  N.  THE  LARGE  SAMPLE  APPROXIMATION  FOLLOWS  THE  STANDARD 
NORMAL  DISTRIBUTION.  THE  LEVEL  CF  SIGNIFICANCF  PRINTED  IS  TWICE  THE 
UPPER  TAIL  PROBABILITY  CF  THE  ABSOLUTE  VALUE  Of  S. 

FOR  N GREATER  THAN  B.  THE  NORMAL  APPROXIMATION  IS  ADEQUATE.  FOR 
N LESS  THAN  OR  EQUAL  TO  B.  THE  READER  IS  REFERRED  TO  TABLE  A. 21  IN 
HOLLANDER  AND  WOLFE  11973). 
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REFERENCES? 

1.  HOLLANDERi  M.  AND  O.A.  WOLFE  (19731  NONPAIUNETRIC  STAllSTICAL 
NETHOOS*  WILEY*  NEW  YORK.  PP  185-193*  384-393. 

2.  NOETHER*  6.  (19711  INTRODUCTION  TO  STATISTICS?  A FRESH  ARRROACN. 
HOUGHTON-MIFFLIN  CO.*  BOSTON.  PP  155-162. 
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CHI-SQUARE 

I d 

SCHISQ  I1»I2»...,150 


PARAMETER  FIELD  (COLUMNS  8-791 
1 I 
12 

• • 

..  INPUT  VAPIAbLcS  (CCLUMN'SI 

• • 

I pO 


OUTPUT 

1.  A CHI-SUUARt  STATISTIC  IS  CALCULATfcO  FOR  A TWO-WAY 
CCNTINGfcNCY  TABLE  IN  WHICH  THE  INPUT  VARIA3LES  FCKM 

THE  Columns  and  the  cases  form  the  rows. 

2.  UEGREES  FREEDOM  AND  SIGNIFICANCE  LEVEL  OF  THE 
CHI-SJUARF  VALUt  AND  THE  CONTINGENCY  COEFFICIENT  ARE 
GIVEN. 


USAGE 

1.  A MAXIMUM  UF  50  VARIABLES  MAY  BE  USED. 

2.  THERE  IS  A LIMIT  CF  100  CASES. 

3.  THc  PRODUCT  OF  THE  NUMBER  CF  VARIABLES  TIMES  THE 
NUMBER  OF  CASES  MAY  NUT  EXCEED  lAOO. 


EXAMPLES 

CAROS  COMMENTS 

1 a 

SCHISQ  1»2.3  A CHI-SQLAR5  TEST  WILL  BE  CALCULATED 

USING  VARIABLE  If  2 AND  3 AS  COLUMNS 
AND  ALL  CASES  AS  ROWS. 

SCHISQ  ALL  ALL  VARIARIABLES  AND  CASES  WILL  BE 

USED  TO  PERFORM  A CHI-SQUARE  TEST, 


! 


c 


STATISTICAL  KUUUAF  S PEC  I F IC  A T I GN  CArtCS 


68 


THE  CHI-SQUAKE  STATISTICf  CHI-SQUARE  > SUHI (IOBS-EXPIf2l/EXPl» 
HAY  BE  USED  TC  TEST  A NUMBER  OF  HYPOTHESES.  THIS  PROGRAM  COMPUTES  A 
CHI-SQUARE  STATISTIC  TO  TEST  THE  HYPOTHESIS  CF  INDtPENOcNCfc  OF  THE  2 
variables  OF  CLASSIFICATION  IN  AN  RXC  CONTINGENCY  TABLEt  WHERE  R IS 
THE  NUMBER  JF  ROwSt  (DISCRETE  CATEGORIES  OF  ONE  VARlABLEIt  AND  C IS 
TH=  NUMBER  OF  COLUMNS,  (DISCRETE  CATEGORIES  OF  THE  OTHER  VARIABLE). 
THE  DATA  ARE  THE  NUMBERS  CF  OBJECTS  CR  SUBJECTS  FALLING  INTO  EACH  OF 
THE  RXC  CELLS. 

AN  RXC  CONTINGENCY  TABLE 
VARIABLE  A 

v********* ***««««*«***#*** «****««« 
*m**********************************0 


* * * * * « 

♦I  ♦ Nil  * N12  NIC  * Nl. 

V * « * «>  * * 

A **♦* * 

ti  * * * * * * 

I *2  * N2l  ♦ N22  ♦....•  N2C  ♦ N2. 

A « « * * « * 

R ♦♦♦* * 

L •• 

E*. 

♦ • • 0 0 ^ 0 

B ♦♦♦♦ 0 

0 0 0 0 0 0 

*R  0 NRl  * NR2  «...  . * NRC  * NR. 

0 0 0 0 0 0 

0000000000000000000000000000000000000 

N.l  N.2  N.C  N.. 


IF  THE  TWO  VARIABLES  OF  CLASSIFICATION  ARE  INDEPENDENT,  THE 
PROBABILITY  OF  AN  OBSERVATION  FALLING  IN  THE  IJTH  CELL,  (I>l,...R, 
J«l,...,C),  WILL  BE  EQUAL  TO  THE  PROBABILITY  OF  THE  OBSERVATION 
FALLING  IN  THE  ITH  ROW  TIMES  THE  PROBABILITY  CF  THE  CBSERVATION 
FALLING  IN  THE  JTH  COLUMN.  EXPECTED  NUMBERS  OF  OBSERVATIONS  IN  EACH 
CELL  ARE  THEN  CALCULATED  ASSUMING  THE  RCWS  ANC  COLUMNS  ARE 
INDEPENDENT.  SPECIFICALLY,  THIS  PROGRAM  CALCULATES  THE  EXPECTED 
NUMBERS  FOR  EACH  IJTH  CELL  ASi  EXPd J) =(NI .) (N. Jl/N*. , WHERE  Nl.  IS 
THE  TOTAL  NUMBER  OF  CbSERVAT ICJNS  IN  THE  ITH  ROW,  N.J  IS  THE  TOTAL 
NUMBER  rjF  OBSERVATIONS  IN  THE  JTH  COLUMN,  AND  N*.  IS  THE  TOTAL  NUMBER 
OF  OBSERVATIONS  IN  THE  EXPERIMENT.  IT  MAKES  SENSE  INTUITIVELY  THAT 
THE  LARGER  THE  DEVIATIONS  OF  THE  OBSERVED  AND  EXPECTED  VALUES,  THE 
LESS  LIKELY  IT  IS  THAT  THE  ROWS  ANC  COLUMNS  ARE  INDEPENDENT.  THE 
CHI-SQUARE  STATISTIC  PROVIDES  A METHOD  OF  OECIOING  WHAT  THE 
PROBABILITY  IS  THAT  THE  ROWS  AND  COLUMNS  ARE  INDEPENDENT,  BASED  ON  THE 
MAGNITUDE  OF  THE  DEVIATIONS  BETWEEN  THE  OBSERVED  AND  EXPECTED  NUMERS 
IN  EACH  CELL.  THE  FOLLOWING  STATISTIC  IS  COMPUTED. 

CHI-SQUARE-SUM( ( (OBS( IJ  )>EXP(IJ) I t2)/EXP( 1J)1, 


WHERE  SUM  INDICATES  SUMMING  OVER  I AND  J FOR  AND 

CHI-SQUARE  FOLLOWS  AN  APPROXIMATE  CHl-SUUARE  DISTRIBUTION  WITH 
fR-ll(C-ll  DEGREES  OF  FREEDOM.  THE  PROBABILITY  OF  FINDING  A GREATER 


c 
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CHI-S3UARE  value  BY  CHANCE  ALCNE  IS  GIVEN.  THIS  IS  THE  LOWEST 
SIGNIFICANCE  LEVEL  AT  wHiCH  THE  NULL  HYPOTHESIS  OF  INDEPENDENCE  NAY  BE 
REJECTED. 

FOR  2X2  TABLES,  (R=2,C=2),  THE  CHl-SQUARE  STATISTIC  IS  CALCULATED 
WITH  A CORRECTION  FUR  CONTINUITY  AND  WITHOUT  SUCH  CORRECTION.  THE 
CHI-SQUARE  CORRECTEP  FCk  CONTINUITY  IS  CALCULATED  AS 


CHI -SOU ARE 


[(ABS((N11MN22)-(NLZI  (N21I  I -N . . /2  It  2 IN.. 


(NI.MN2.)  (N.n  (N.2) 


WITH  1 DEGREE  OF  FREEOCH.  THE  CHI-SGUARE  STATISTIC  FOR  THE  2X2  TABLE 
WITHOUT  CORRECTION  FCR  CONTINUITY  IS  CALCULATED  AS  ABOVE  BUT  WITHOUT 
THE  TERM  N../2.  GENERALLY,  THE  CORRECTED  CHI-SQUARE  GIVES  A BETTER 
APPROXIMATION  TO  THE  CHI-SQUARE  01 S TRI BUTI ON.  HOt»EVtR,  SOME  PREFER  NOT 
TO  USc  IT, 

REFERENCE 


STEELE,  R.  G.  0.  AND  J.H.  TORRIE  (I960). 
OF  STATISTICS.  MCCRAW-HILL,  NEW  YCRK.  PP 


PRINCIPLES  ANO  PROCEDURES 

366-375. 
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LEAST  SQUARES  REGRESSION 

L 8 

SLSQR  llfI2f.tI50 


PARAMETER  FIELD  (CuLUMNS  8-79) 

11  - DEPENDENT  VARIABLES 

12  - 

• • 

..  INDENDENT  VARIABLES 

• • 

150  - 


OUTPUT 

FOR  EACH  CUHBIKATION  OF  THE  DEPENDENT  VARIABLE  WITH  AN 
INDEPENDENT  VARIABLE  - 


1.  INTERCEPT  AND  SLOPE  FOR  THE  LEAST  SQUARES  LINE  ARE  GIVEN. 

2.  T-STATISTIC  WITH  DEGREES  CF  FREEDOM  AND  SIGNIFICANCE  LEVEL 
ARE  COMPUTED  TO  TEST  THE  HYPOTHESIS  THAT  THE  SLOPE  IS  ZERO. 

i,  THE  NUMBER  CF  OBSERVATIONS  IS  GIVEN. 

4.  THE  STANDARD  ERROR  OF  THE  SLOPE  ESTIMATEt  THE  CORRELATION 
COEFFICIENT,  AND  THE  COEFFICIENT  OF  DETERMINATION  ARE  GIVEN. 

5.  THE  F-STATISTIC,  DEGREES  CF  FREEDOM,  AND  SIGNIFICANCE 
LEVEL  FOR  TESTING  THE  OVERALL  SIGNIFICANCE  CF  REGRESSION 
ARE  PRINTED,  AND 

6.  MEANS  AND  STANDARD  DEVIATIONS  ARE  PRINTED. 


USAGE 

1.  A MAXIMUM  OF  ONE  UEPENOENT  ANC  49  INDEPENDENT  VARIABLES 
ARE  ALLOWED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLE 

CARO  COMMENTS 

1 8 

ALSQR  1,3,6  TWO  LEAST  SQUARES  ANALYSES  WILL  BE 

PERFORMED-  (DEPENDENT  VARIABLE-1  WITH 
INDEPENDENT  VARIABLE-3)  AND  (DEPENDENT 
VARIABLE-1  WITH  INDEPENDENT  VARIABLE-6). 
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SIMPLE  LINEAR  REGRESSION  IS  A METHOD  OF  FITTING  A STRAIGHT  LINE 
TO  A COLLECTION  OF  POINTS  lYIDtXm).  THE  MODEL  IS 

Ym>  BO  *■  Bixm  * Em 

WHERE  YM)  AND  X(I>  ARE  THE  ITH  OBSERVATIONS  OF  Y ANO  X II>L>...»N)» 
BO  « THE  UNKNCWK  INTERCEPT  PARAMETERt 
B1  » THE  UNKNOWN  SLOPE  PARAMETERt  ANO 

Eli)  3 AN  UNKNOWN  INCREMENT  BY  WHICH  ANY  Y(I)  MAY  VARY  FROM 
THE  TRUE  LINE  Y=BO  ♦ BIX. 

LEAST  SQUARES  IS  A METHOD  CF  ESTIMATING  BO  ANO  B1  WITH  BO  ANO  B1 
(ESTIMATES  ARE  NOT  IN  BULOFACt)  SO  THAT 

SUM  (Emt2)»  SUM(IY(I)>BO-BIX(Illt21 

IS  A MINIMUM,  THAT  ISt  SC  THAT  THE  SUM  CF  THE  SCUAREO  DEVIATIONS  OF 
the  PREDICTED  YtS,  YHATdl*  BO  *■  BIX(I),  ANO  THE  OBSERVED  YtS  IS  A 
MINIMUM. 

THE  ESTIMATES  OF  BO  AND  B1  ARE 

SUM!  (X(  I l-MXM  Y(  I I-MY)  1 SUM!  (X(  I)  )(Y(  I))  ]-(  (SUMX(  I ) )(  SUMY(I)  ) J/N 

Bl  

SUM!  <X(  I l-MX)  t21  SUM(X(  nt2l-((SUMX(I)  If21/N 


ANO 

83  * MY  - BIMX 

WHERE  MY  AND  MX  ARE  THE  MEANS  OF  Y ANO  X»  ANO  SUM  INDICATES  SUMMING 
OVER  I FOR  I=clt...,N. 

ANOTHER  OBJECT  OF  L INcAR  REGRESSION  IS  TO  TEST  HYPOTHESES  ABOUT 
THE  RELATIONSHIP  BETWEEN  THE  2 VARIABLES  OF  INTEREST.  THE  MOST 
FREQUENTLY  STATED  NULL  HYPOTHESIS  IS  THAT  THERE  IS  NO  RELATIONSHIP 
BETWEEN  THE  VARIABLES:  HO?  B1=0  AGAINST  POSSIBLE  ALTERNAT I VESt  Bl*Ot 
B1>0,  BKO.  THE  TEST  STATISTIC  IS 

T a Bl/STANOARD  ERROR  OF  61. 

T HAS  N-2  DEGREES  OF  FREEDOM.  THE  COMPUTATIONAL  FORMULA  FOR  THE 
STANDARD  ERROR  OF  61  IS  RATHER  LENGTHY  AND  WILL  NOT  BE  PRESENTED  HERE. 
BUT  MAY  BE  FOUND  IN  THE  REFERENCE  GIVEN. 

ANOTHER  EQUIVALENT  TEST  IS  THE  F TEST  FCR  SIGNIFICANCE  OF 
REGRESSION.  THE  COMPUTATION  OF  THE  F-STATISTIC  IS  MOST  INTELLIGIBLE 
PRESENTED  IN  TABULAR  FORM.  THE  SOURCES  OF  VARIATION  IN  THE  REGRESSION 
MODEL  MAY  BE  PARTIONEO  AS  TOTAL  VARIATION*  VARIATION  DUE  TO  REGRESSION 
*■  VARIATION  ABOUT  REGRcSSION.  THE  VARIATION  ASSOCIATED  WITH  EACH 
SOURCE  ARE  DEFINED  AS  IN  THE  TABLE'  BELOW.  COMPUTATIONAL  FORMULA  OF 
SST  AND  SSR  MAY  BF  FOUND  IN  THE  REFERENCE  GIVEN. 
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SJ'JRLc  * OF*  SUMS  Or  SOUARcS 


* MEAN  SQUARES 


j TOTAL 

! 

DUE  TO 


*N-l*  SST=  SUMI(/(I)-MY)t2) 

* * 

* * SSR* 


* 

* 

* 


REGRESSION*  I * 61ISUHl(X(I)-MXI(YII)-MYm  * MSR*SSR 

♦ * ♦ 

ARTUT  • * * 

Rer.«FSSICN*N-2*  SSE=SUMHYm-YHAT)t2)  = SST-SSR*  MSE*SSE/N-2 

******«**•••**•*••«*«*•*•«**•**•*****«*••«••••••«•*«•«••♦•«•• 


THE  TEST  STATISTIC  IS  F«NSR/MSE  »«ITH  I AND  N-2  DEGREES  OF  FREEDOM, 

NOTE  THAT  Tt2«F.  THE  SIGNIFICANCE  LEVELS  OF  T ANO  F ARE  GIVEN. 

OTHER  QUANTITIES  CALCULATED  ARE  THE  CGBRELATICN  CCEFFICIENT  R, 

AND  THE  C'JEFFICIcNT  OF  DETERMINATION,  Rt2=SSR/SST.  Rt2  IS  A MEASURE 
OF  THE  PRuPORTIJN  CF  THE  TOTAL  VARIATION  ABCUT  THE  MEAN,  MY,  EXPLAINED 
8Y  THE  regression  OF  Y ON  X.  NATURALLY,  THE  LARGER  R»2  THE  BETTER  THc 
FIT  IS, 

REFERENCE 

DRAPER,  N.  R.  AND  H.  SMITH  (19661.  APPLIED  REGRESSION  ANALYSIS.  JOHN 
WILEY  ANO  SONS,  INC.,  NEW  YORK.  PP  1-34. 


C,  iTATISriCAL  .OUTINt  SPEC  I F 1C  AT  ION  CARCS  7i 

MULTIPLE  REGRESSION 

1 ti 

SNREG  Il»I2f*«125 

PARAMETER  FIELD  (COLUMNS  8-79) 

11  - DEPENDENT  VARIABLE 

12  - 

• • 

..  INDEPENDENT  VARIABLES 

• • 

125  - 


OUTPUT 

1.  THE  MEANS,  AND  STANDARD  DEVIATIONS  OF  ALL  VARIABLES 
ARE  PRINTED. 

2.  CORRELATIONS  BETWEEN  ALL  VARIABLES  ARE  GIVEN, 

3.  INTERCEPT,  REGRESSION  COEFFICIENTS,  STANDARD  ERRORS 
JF  REGRESSION  COcFF IC lENTS,  T-VALUES,  DEGREES  OF 
FREEDOM  AND  SIGNIFICANCE  LEVELS  OF  THE  T-VALUES  FOR 
TESTING  hypotheses  THAT  THE  REGRESSION  COEFFICIENTS 
ARE  ZERO  APE  INCLUDED. 

A,  MULTIPLE  CORRELATION  AND  OETERM INATION  ARE  GIVEN. 

5.  THE  STANDARD  ERROR  OF  THE  ESTIMATE  IS  PRINTED. 

b.  AN  ANALYSIS  OF  VARIANCE  FOR  THE  REGRESSION  IS  GIVEN. 
this  INCLUDES  THE  SUMS  OF  SQUARES  ATTRIBUTABLE  TO 
REGRESSION,  ABOUT  THE  REGRESSION,  AND  OF  THE  TOTAL. 
DEGREES  OF  FREEDOM,  MEAN  SQUARES,  THE  F-STATISTIC 
AND  ITS  LEVEL  OF  SIGNIFICANCE  ARE  PRINTED. 


USAGE 

1.  CNE  DEPENDENT  VARIABLE  MAY  BE  USED. 

2.  NC  MORE  THAN  2A  INDEPENDENT  VARIABLES  MAY  BE  USED. 

3.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 


EXAMPLES 
CAROS 
1 8 
SMREG  ALL 


$MRE6  3,1,2 


COMMENTS 

A MULTIPLE  REGRESSION  ANALYSIS 
USING  VAR.  I AS  THE  DEPENDENT 
variable  and  REMAINING  VARIABLES 
AS  THE  INDEPENDENT  VARIABLES 
MILL  BE  PERFORMED. 

A MULTIPLE  REGRESSION  ANALYSIS 
WILL  BE  PERFORMED  USING  VAR.  3 
AS  THE  DEPENDENT  VARIABLE  AND 
VARIABLES  I AND  2 AS  THE  INDEPENDENT 
VARIABLES. 
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I.M  multiple  REGRcSSION,  the  model  is 

vm<  BO  * Bl*xim  ♦ B2*X2(I)  4^  ...  * BK*XK(I)  * Em» 

WHERE  Yin  is  the  ITF  MEASUBcMENT  OF  THE  DEPENDENT  VARIABLE  Y, 

Xllllf  X2(I tt...tXK(I)  ARE  THE  ITH  MEASUREMENTS  OF  THE 
independent  VARIABLESt  XI,  X2,...,XK  (I-l,....N), 

BO,  Bl,  B2,...,BK  ARE  THE  UNKNOWN  COEFFICIENTS  ASSOCIATED  WITH 
THE  INTERCEPT  AND  THE  K INDEPENDENT  VARIABLES,  AND 
E(U  = ERROR  ASSOCIATED  WITH  THE  ITH  MEASUREMENT. 

ONE  OBJECT  OF  THIS  PROGRAM  IS  TC  ESTIMATE  BO,  B1,...,BK  SO  THAT  THE 
SUM(E(I)*2I  IS  A MINIMUM.  ANOTHER  OBJECT  IS  TO  DECIDE  IF  THE  OVERALL 
REGRESSION  OF  Y ON  XI,  X2,...,XK  IS  SIGNIFICANT  (I.E.,  ACCOUNTS  FOR 
OBSERVED  VARIATION  IN  Y).  IN  THIS  REGARD,  A TEST  IS  PROVIDED  OF  THE 
HYPOTHESIS  HO?  B1=‘B2  = ...=BM0,  AGAINST  THE  ALTERNATE  HYPOTHESIS,  NOT 
ALL  BI»0  FOR  1=1,. ..,K.  FURTHER  TESTS  ARE  PROVIDED  FOR  HYPOTHESES  OF 
THE  FORM  HO?  BI=0,  1=1,.. .,K. 

THE  FOLLOWING  STATISTICS  ARE  OUTPUT. 

1.  THE  MEANS  AND  STANDARD  DEVIATIONS  OF  Y AND  X1,...,XK  ARE  PRINTED. 

2.  THE  CORRELATION  MATRIX  OF  THE  DEPENDENT  AND  INDEPENDENT  VARIABLES 
IS  GIVEN.  THIS  GIVES  MEASURES  OF  ASSOCIATION  BETWEEN  THE  DEPENDENT 
VARIABLE  AND  EACH  INDEPENDENT  VARIABLE  AND  AMONG  THE  INDEPENDENT 
VARIABLES. 

3.  THE  ESTIMATES  OF  Blf.tBK  AND  THE  STANDARD  ERRORS  OF  THE 
ESTIMATES  ARE  PRINTED.  T STATISTICS  ARE  PRINTED  FOR  TESTING  EACH 
HYPOTHESIS  HO?  BI=0,  1=1,. ..,K. 

T=  BI2STANOARO  ERR0R(8I»  WITH  N-K  DEGREES  OF  FREEDOM. 

THE  LEVEL  OF  SIGNIFICANCE  IS  GIVEN  FOR  THE  TEST  OF  EACH  NULL 
HYPOTHESIS  AGAINST  THE  TWO-SIDED  ALTERNATIVE  BI*0.  FOR  TESTS  AGAINST 
ONE-SIDED  ALTERNATIVES,  BI>0  OR  BKO,  USE  THE  PRINTED  LEVEL  OF 
SIGNIFICANCE  DIVIDED  BY  2. 

4.  THE  ESTIMATE  OF  BO  IS  OUTPUT  AS  THE  INTERCEPT. 

5.  MULTIPLE  CORRELATION,  R,  IS  A MEASURE  OF  THE  COMBINED  EFFECT  OF 
THE  INDEPENDENT  VARIABLES  ON  THE  DEPENDENT  VARIABLES. 

6.  THE  COEFFICIENT  OF  MULTIPLE  DETERMINATI CN,  R*2,  IS  A MEASURE  OF 
THE  PROPORTION  OF  THE  TOTAL  VARIATION  ABOUT  THE  MEAN  OF  Y EXPLAINED  BY 
THE  REGRESSION  OF  Y CN  X1,...,XK.  IT  CAN  BE  THOUGHT  OF  AS  A MEASURE 
OF  THE  USEFULNESS  OF  THE  INDEPENDENT  VARIABLES  IN  ACCOUNTING  FOR 
VARIATION  IN  THE  Y(  IHS. 

7.  THE  STANDARD  ERROR  OF  THE  ESTIMATE  IS  AN  ESTIMATE  OF  THE  PRECISION 
OF  THE  REGRESSION.  IT  IS  EQUAL  TO  THE  SQRTIMEAN  SQUARE  ABOUT  THE 
REGRESSION! . 

8.  AN  ANALYSIS  OF  VARIANCE  TABLE  FCR  THE  OVERALL  REGRESSION  IS 
INCLUDED.  THE  SOURCES  CF  VARIATION  AND  ASSOCIATED  DEGREES  CF  FREEDOM, 
SUMS  OF  SQUARES  AND  MEAN  SQUARES  ARE  GIVEN.  AN  F-STATISTIC  IS 
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CALCULATED  AS  THE  RATIO  UF  THE  MEAN  SQUARE  DUE  TC  REGRESSION/MEAN 
SQUARE  ABOUT  THE  REGRESSION.  THE  DEGREES  OF  FREEDOM  OF  THE  CALCULATED 
F ARE  K.N-K  (KsNO.  OF  PARAMETERS*  N>  NO.  OF  OBSERVATIONS).  THE 
F-STATISTIC  MAY  BE  USED  TO  TEST  THE  HYPOTHESIS,  HOT 
AGAINST  THE  ALTERNATIVE,  NOT  ALL  BI«0,  I«1,...,K.  THE  LEVEL  OF 
SIGNIFICANCE  IS  THE  PRCdABILITY  OF  OBTAINING  A LARGER  F WITH  K,N-K 
O.F.  3Y  CHANCE  ALONE. 

REFERENCE 

DRAPER,  N.  R.  AND  H.  SMITH  (1966).  APPLIED  REGRESSION  ANALYSIS.  JOHN 
WILFY  AND  SONS.  INC.,  NEW  YORK.  PP  44-85. 
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STEP-WISE  MULTIPLE  REGRESSION 

1 8 

SSWREG  I1«I2»...,I25 


PARAMETER  FIELD  iCCLUMNS  6-7S) 

11  - DEPENDENT  VARIAriL? 

12 

• • 

..  - INDEPENDENT  VARIAttLES 

• • 

125 


OUTPUT 

THE  OUTPUT  INCLUDES 

1.  MEANS  AND  STANOAhO  OEVIATTUNS  OF  ALL  VARIABLES: 

2.  correlation  maktix; 

3.  SUM  OF  SQUARES  RFCUCEO  IK  EACH  STEP; 

<».  CUMULATIVE  SUM  CF  SQUARES  RPCUCED*. 

5.  MUTIPLE  CORRELATICN  COEFFICIENTS  FOR  EACH  STEP; 

6.  F-VALUEt  DEGREES  OF  FREEDCM,  AND  SIGNIFICANCE  LEVEL  CF 
F-VALUE  FOR  TESTING  SIGNIFICANCE  OF  OVERALL  REGRESSION 
AT  EACH  STEP; 

7.  STANDARD  ERROR  OF  ESTIMATE,  AND 

8.  AT  EACH  STEP  FOR  EACH  INDEPENDENT  VARIABLE  ENTERED, 
REGRESSION  COEFFICIENTS,  STANDARD  ERROR  OF  REGRESSION 
COEFFICIENTS,  AhD  T-VALUES  KITH  DEGREES  OF  FREEDOM  AND 
SIGNIFICANCE  LEVEL  FOR  TESTING  HYPOTHESES  THAT  REGRESSION 
COEFFICIENTS  ARE  ZERO. 


USAGE 

1.  ONE  DEPENDENT  VARIABLE  MAY  8c  USED. 

2.  NO  MORE  THAN  2A  INDEPENDENT  VARIABLES  MAY  Be  USED. 

3.  THERE  IS  NC  LIMIT  ON  THE  NUMBER  CF  OBSERVATIONS. 


EXAMPLES 

CAROS  COMMENTS 

1 8 

iSWREG  ALL  STEPHISE  REGRESSION  GILL  BE  PERFORMED 

WITH  VAR.  I AS  THE  DEPENDENT  VARIABLE, 
AND  REMAINING  VARIABLES  AS  INDEPENDENT 
VARIABLES. 

SSWREG  4, 3, 2,1  STEPWISE  REGRESSION  WILL  BE  PERFORMED 

WITH  VAR.  4 AS  THE  DEPENDENT  VARIABLE, 
AND  variables  3,  2 AND  1 AS  THE 

INDc pendent  variables. 
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STEP'HISE  MULTIPLE  REGRESSION  IS  A RETHCD  GF  OECIOING  WHICH 
INJEPENOENT  variables  are  IMPORTANT  IN  TERMS  CF  ACCOUNTING  FOR 
VAJIATION  IN  THE  DEPENDENT  VARIABLE  Y.  INDEPENDENT  VARIABLES  ARE 
ENTERED  INTO  THE  REGRESSION  SEQUENTIALLY,  AND  AS  EACH  VARIABLE  IS 
ENTERED,  A TEST  MAY  oE  MADE  TO  DECIDE  IF  THE  RESIDUAL  SUM  OF  SQUARES 
HuS  BEEN  REDUCED  SIGNIFICANTLY. 

IN  THE  FIRST  STEP,  THE  INDEPENDENT  VARIABLE  WHICH  IS  MOST  HIGHLY 
CORRELATED  WITH  THE  DEPENDENT  VARIABLE  IS  ENTERED  INTO  THE  REGRESSION 
EQUATION.  OUTPUT  INCLUOES  SUM  CF  SCLARES  RECUCEC  BY  INCLUSION  CF  THIS 
iNDEPcNOENT  VARIABLE  ANC  THE  PROPORTION  OF  THE  TOTAL  REDUCED.  AN 
F-TEST  is  performed  on  this  PEGRESSICN  AND  THE  SIGNIFICANCE  LEVEL  OF 
THii  CALCULATED  F-STATISTIC  IS  PRIfUtD.  A DECISION  MAY  THEN  BE  MADE  AS 
T»  WHETHER  THc  REGRESSION  OF  Y ON  THE  INDEPENDENT  VARIABLE  HOST  HIGHLY 
C )RR  = LATED  WITH  Y IS  SIGNIFICANT.  IF  IT  IS  CECICEO  THAT  THIS 
REGRESSION  IS  SIGNIFICANT,  PROCEED  TO  STEP  2. 

IN  THE  SECOND  STEP,  THE  INDEPENDENT  VARIABLE  WHICH  HAS  THE 
HIGHEST  PARTIAL  CJKRtLATIJN  WITH  Y (ALLOWING  FOR  THE  INDEPENDENT 
VARIABLE  ENTERED  IN  THE  FIRST  STEP)  IS  ENTERED  INTO  THE  REGRESSION 
El'JATION.  A DECISION  MAY  HE  MADE  WHETHER  THE  NEWLY  ENTERED  VARIABLE 
MAKES  A SIGNIFICANT  REDUCTION  IN  THE  RESIDUAL  SU»*  OF  SQUARES  BY 
REFERRING  TO  THE  SIGNIFICANCE  LEVEL  CF  THE  COMPUTED  T-VALUE  FOR  THE 
REGRESSSION  COEFFICIENT  CF  THE  NEWLY  ENTERED  VARIABLE.  IF  IT  IS 
DECIDED  THAT  THE  NEWLY  ENTtRED  VARIABLE  MAKES  A SIGNIFICANT 
CONTRIBUTION  TO  REDUCING  THE  RESIDUAL  SUM  OF  SQUARES,  PROCEED  TC  STEP 
i. 

IN  STEP  i AND  SUBSEQUENT  STEPS,  THE  INDEPENDENT  VARIABLE  WITH  THE 
HIGHEST  PARTIAL  CORRELATION  WITH  THt  DEPENDENT  VARIABLE  (ALLCWING  FOR 
PREVIOUSLY  ENTERED  iNOEPENOcNT  VARIABLES)  IS  ENTERED  INTO  THE 
REGRESSION  EGUATICN.  A DECISION  MAY  BE  MADE  WHETHER  THE  NEWLY  ENTERED 
VARIABLE  MAKES  A SIGNIFICANT  REOLCTION  IN  THE  RESIDUAL  SUM  CF  SQUARES 
OVER  THE  REDUCTION  oROCUCtO  BY  VARIABLES  ALREADY  IN  THE  EQUATION  BY 
REFERRING  TO  THE  SIGNIFICANCE  LEVEL  CF  THE  COMPUTED  T-VALUE  OF  THE 
NEWLY  ENTERED  VARIABLE.  WHEN  IT  IS  DECIDED  THAT  A NEWLY  ENTERED 
variable  does  NUT  TAKE  UP  A SIGNIFICANT  AMOUNT  OF  VARIATION  IN  Y, 
BEYOND  THAT  ACCOUNTED  FuR  BY  PREVIOUSLY  ENTERED  VARIABLES,  THE  NEWLY 
cNTcHEO  VARIABLE  AND  ALL  VARIABLES  NOT  ALREADY  IN  THE  KEGRESSION 
EQUATION  ARE  NOT  INCLUJcD  IN  THE  REGRESSION  EQUATION. 

THE  REGRESSION  COEFFICIENTS  AND  F-STATISTICS  FOR  THE  REGRESSION 
EQUATION  WILL  BE  THOSE  IN  THE  STEP  PRECEDING  THE  STEP  IN  WHICH  IT  IS 
OFCIDEO  NOT  TO  INCLUDE  A VARIABLE  IN  THE  EQUATION. 


B-FERfcNCES 

OMAMcM,  N.R.  AND  H.  SMITH  (1966)  APPLIED  REGRESSION  ANALYSIS.  JOHN 
wlL€Y  AND  SONS,  NEW  YORK.  PP  169-171. 
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POLYNOMIAL  REGRESSION 

1 8 

8POLR6  I1«I2»I3 


PARAMETER  FIELD  I COLUMNS  8-79) 

11  - DEPENDENT  VARIABLE 

12  - INDEPENDENT  VARIABLE 

13  - DEGREE  POLYNOMIAL  TO  BE  FITTED 


OUTPUT 

1.  THE  NUMBER  OF  CASES  IS  GIVEN* 

2.  THE  INTERCEPT  AND  REGRESSION  COEFFICIENTS  ARE  PRINTED. 

3.  AN  ANALYSIS  OF  VARIANCE  FOR  THE  REGRESSION  IS  COMPUTED. 


USAGE 

1.  ONLY  ONE  DEPENDENT  AND  ONE  INDEPENDENT  VARIABLE  MAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS. 

3.  A 5TH  DEGREE  POLYNOMIAL  OR  A LESSER  DEGREE  MAY  BE  FITTED. 
fI3  2 S). 


EXAMPLE 
CARDS 
1 8 
8POLRG  2.4.3 


1 


COMMENTS 

A 3RD  OEGREF  POLYNOMIAL  WILL  BE 
FITTED  USING  VAR.  2 AS  THE  DEPENDENT 
VARIABLE  AND  VAR.  4 AS  THE  INDEPENDENT 
VARIABLE. 
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IN  THIS  ROUTINE*  THE  METHOD  GF  LEAST  SQUARES  IS  USED  TO  ESTIMATE 
THE  BJ«S*  J*0....»5*  IN  ANY  OF  THE  FOLLOWING  FIRST  THROUGH  FIFTH 
DEGREE  POLYNOMIAL  MODELS? 


Ydl^SO  * B1  XI 1 1 * Ein 
YCIIsBO  * Bl  X(II  ♦ 62  IX<nt21 
Y(I)»BO  ♦ Bl  xm  ♦ B2  IX(Ilt21 
Y(U»BO  * Bl  X(l)  * B2  Ixmt21 
YCII>BO  * 61  xm  * BZ  (xmf21 
B5  ixmtsi  * Ed) 


* ECU 

+ 63  IXm«31 

* B3  ixmt31 

* B3  ixmt31 


♦ Ell) 

♦ 64  [Xint4l  ♦ Elll 

♦ B4  fxm«4i  ♦ 


WHERE 


xm  AND  YU)  ARE  OBSERVATIONS  OF  THE  INDEPENDENT  VARIABLE  X AND 

THE 

DEPENDENT  VARIABLE  Y RESPECTIVELY 


BJ*S  ARE  UNKNOWN  PARAMETERS  TO  BE  ESTIMATED  AND 
Ell)  IS  A RANDOM  ERROR. 

IT  IS  ASSUMED  THAT  THE  EIDfS  ARE  INDEPENDENT  AND  NORMALLY 
DISTRIBUTED  WITH  MEANS  ZERO  AND  COMMON  VARIANCE. 

ONLY  ONE  MODEL  MAY  BE  CHOSEN  WITH  ANY  BPOLBG  CARO.  IT  WILL  BE 
NOTED  THAT  IF  A POLYNOMIAL  OF  DEGREE  1 IS  CHOSEN*  BP0LR6  WILL  PERFORM 
THE  SAME  CALCULATION  AS  $LSQR. 


REFERENCE 

DRAPER*  N.R.  AND  H.  SMITH  11966)  APPLIED  RE6RESSI0N  ANALYSIS*  JOHN 
WILEY  AND  SONS*  NEW  YORK.  PP  129>130. 
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NORMALITY  TEST 


1 8 

SNORNT 


PARAMETER  FIELD  {COLUMNS  8-79) 

11 

12 


INPUT  VARIABLES 


ISO 


OUTPUT 

FOR  EACH  VARIABLE  SPECIFIED  THE  OUTPUT  INCLUDES 

1.  SUM,  MEAN,  STANDARD  DEVIATION,  STANDARD  ERROR  OF 
the  MEAN,  NUMBER  OF  CBSERVAT IONS,  MINIMUM,  MAXIMUM, 
AND  RANGE: 

2.  THE  SECOND  MOMENT  ABOUT  THE  MEAN,  AND  THE  3RD  AND 
4TH  MOMENTS  RELATIVE  TO  THE  2ND  MOMENT; 

3.  COEFFICIENT  OF  VARIATION; 

4.  GEARYtS  KURTOSIS  TEST  STATISTIC; 

5.  HISTOGRAM  WITH  OBSERVED  AND  EXPECTED  FREQUENCIES,  AND 

6.  CHI-SQUARE  TEST  FOR  GOODNESS  OF  FIT  TO  THE  NORMAL 
DISTRIBUTION, 


USAGE 

1.  A MAXIMUM  OF  50  VARIABLES  MAY  BE  USED. 

2.  THERE  IS  NO  LIMIT  ON  THE  NUMBER  OF  OBSERVATIONS, 


EXAMPLE 

CAROS  COMMENTS 

1 8 

SNORMT  1,6  NORMALITY  TESTS  WILL  BE  PERFORMED 

ON  VARIABLES  1 AND  6. 
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C« 


THIS  PROGRAM  CONTAINS  FOUR  TEST  STATISTICS  WHICH  MAY  BE  USED  TO 
DECIDE  IF  THE  OBSERVATIONS  OF  INTEREST  ARE  NORMALLY  DISTRIBUTED. 

1.  THE  RELATIVE  THIRD  MOMENT  ABOUT  THE  MEAN* 


SUMI  (Xm-XBARM31/N 

GUI  t 

I(SUMI(xm>XBAR)»21/N)  t(3/2)l 


IS  A MEASURE  OF  THE  SKEWNESS  OF  THE  DISTRIBUTION  OF  THE  OBSERVATIONS. 
FOR  A NORMAL  DISTRIBUTION  GUI  * 0.  TABLE  A. 6 IN  SNEDECOR  AND  COCHRAN 
(19671  MAY  BE  USED  FOR  DECIDING  IF  Gdl  DIFFERS  SIGNIFICANTLY  FROM  0. 

2.  THE  RELATIVE  FOURTH  MOMENT  ABOUT  THE  MEAN* 


SUM( IX( I)-XBAR»»4 J/N 

G(2I  * 

(SUMK  X(I  )-XdARI  t21/N)t2 


IS  A MEASURE  OF  THE  KURTOSIS  OF  THE  DISTRIBUTION  OF  THE  OBSERVATION. 
FOR  A NORMAL  DISTRIBUTION  G(2I  = 3.  WHEN  GI2)  IS  LESS  THAN  3»  THE 
DISTRIBUTION  IS  SAID  TO  BE  PLATYKURTIC.  FLATTER  THAN  THE  NORMAL  CURVE. 
WHEN  G(2)  IS  GREATER  THAN  3,  THE  DISTRIBUTION  IS  SAID  TO  BE 
LEPTOKURTICf  MORE  PEAKED  THAN  THE  NORMAL  CURVE.  TABLE  A. 6 IN  SNEDECOR 
AMO  COCHRAN  (19671  MAY  BE  USED  FGR  DECIDING  IF  G(2)  DIFFERS 
SIGNIFICANTLY  FROM  3. 

3.  AN  ALTERNATE  METHOD  FOP  TESTING  KURTOSIS  IS  GEARVtS  TEST 

STATISTIC* 


A 


[SUMI A8S(X( 1 )-XBAR ) J 1/N 
SURTKSUMI  (X(  U-XBAR)t2  n/Nl 


FOR  NORMAL  OISTR IBLT  IONS*  A =»  .7979.  TO  DECIDE  IF  THE  OBTAINED 
GEARYtS  KURTOSIS  STATISTIC  DIFFERS  SIGNIFICANTLY  FROM  .7979,  SEE  GEARY 
(1936). 
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4.  A CHI-SQUARE  GOODNESS  OF  FIT  STATISTIC  IS  USED  TO  OEClOE  IF 
THE  08SERVE0  OISTKISUTICN  FITS  A NORMAL  DISTRIBUTION.  THE  GENERAL 
FORMULA  IS 


CHI-SQUARE  • 


SUMUCBSERWED  - EXPECTEOItZI 
EXPECTED 


f 


WITH  K-3  DEGREES  OF  FREEDOM.  K IS  THE  NUMBER  OF  CLASSES  USED  IN 
CALCULATING  CHl-SQUAREt  OBSERVED  IS  THE  OBSERVED  FREQUENCY  OF  EACH 
CLASS.  AND  EXPECTED  IS  THE  EXPECTED  FREQUENCY  OF  EACH  CLASS  IF  THE 
DISTRIBUTION  OF  THE  OBSERVATIONS  IS  NORMAL. 


REFERENCES 

1.  SNEOECOR.  G.W.  AND  W.6.  COCHRAN  (1967)  STATISTICAL  METHODS*  IOWA 
STATE  UNIV.  PRESS,  AMES,  IOWA.  PP  84-90. 


2.  GEARY,  R.C.  (1936)  BIOMETRIKA*  VOL  28?  P 295 
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HISTOGRAM 

1 8 
SHSTGM  II 


PARAMETER  FIELD  (COLUMNS  8-79) 
II  - INPUT  VARIABLE 


OUTPUT 

L.  THE  MEAN.  STANDARD  DEVIATION,  NUMBER  OF  OBSERVATIONS, 
MINIMUM,  MAXIMUM  AND  RANGE  BEFORE  AND  AFTER  SCALING 
AND  THE  MEDIAN  BEFORE  SCALING  ARE  PRINTED. 

2.  THE  LENGTHS  AND  MID-POINTS  OF  THE  INTERVALS  ARE  GIVEN. 

3.  THE  HISTOGRAM  IS  PLOTTED  USING  20  INTERVALS. 


USAGE 

1.  ONLY  ONE  INPUT  VARIABLE  MAY  BE  USED. 

2.  THERE  IS  A LIMIT  OF  1000  CASES. 

3.  AUTO-SCALING  MILL  BE  USED  IF  THE  SSCALE  CARO  DOES  NOT 
PRECEDE  THE  SHSTGM  CARD. 

4.  REFER  TO  THE  $SCAie  DOCUMENTATION  FOR  MANUALLY 
SCALING  HISTOGRAMS. 


EXAMPLE 

CARO  COMMENT 

1 8 

SHSTGM  3 A HISTOGRAM  OF  VARIABLE  3 

MILL  BE  PLOTTED. 


I 
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PLOT  ROUTINE 

1 8 
mOT  11,12 


PARAMETER  FIELD  (COLUMNS  8-791 

11  - DEPENDENT  VARIABLE  (VERTICAL  AXIS) 

12  - INDEPENDENT  VARIABLE  (HCRIZCNTAL  AXIS) 


OUTPUT 

A PLOT  OF  VARIABLE  II  VERSUS  VARIABLE  12  IS  PRINTED. 


USAGE 

1.  ONLY  ONE  DEPENDENT  AND  ONE  INDEPENDENT  VARIABLE  MAY  BE  USEO- 

3.  OVERLAYING  POINTS  ARE  PLOTTED  WITH  A $ SYMBOL. 

4.  AUTO-SCALING  IS  USED  IF  THE  SSCALE  CARO  OCES  NOT  PRECEDE 
THE  8PLOT  CARD. 

5.  REFER  TO  THE  SSCALE  DOCUMENTATION  FOR  MANUAL  SCALING  OF 
THE  AXES. 


EXAMPLE 

CARO  COMMENT 

1 8 

SPLOT  2,4  DEPENDENT  VARIABLE  2 WILL  BE 
PLOTTED  AGAINST  INDEPENDENT 
VARIABLE  4. 
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PLOT  NORMAL 

1 8 

8PLOTN  I1»I2«..«»I50 


PARAMETER  FIELD  (COLUMNS  8-79) 

11  - DEPENDENT  VARIABLE  (VERTICAL  AXIS) 

12  - 

• • 

..  INDEPENDENT  VARIABLES  (HORIZONTAL  AXIS) 

• • 

150  - 


OUTPUT 

A PLOT  IS  MADE  OF  THE  DEPENDENT  VARIABLE  VERSUS  EACH  OF  THE 
INDEPENDENT  VARIABLES  USING  THE  SYMBOL  -A-  FOR  II  VERSUS  I2t 
THE  SYMBOL  -8-  FOR  II  VERSUS  IB»  ETC.t  THROUGH  THE 
COC  FORTRAN  CHARACTER  SET. 


USAGE 

1.  A MAXIMUM  CF  1 DEPENDENT  AND  49  INDEPENDENT  VARIABLES 
MAY  BE  USED. 

■7.  NO  MORE  THAN  2500  DATA  POINTS  (INCLUDING  MISSING  DATA) 
ARE  ALLOWED. 

3.  AUTO-SCALING  IS  USED  IF  THE  8SCALE  CARO  DOES  NOT  PRECEDE 
THE  tPLOTN  CARO. 

A.  REFER  TO  THE  8SCALE  DOCUMENTATION  FOR  MANUAL  SCALING 
JF  THE  AXES. 


EXAMPLES 

CAROS  COMMENTS 

1 8 

APLOTN  4»1»2  PLOT  OF  DEPENDENT  VARIABLE  4 

VERSUS  INDEPENDENT  VARIABLES  1 AND  2 
8PL0TN  ALL  PLOT  OF  THE  1ST  VARIABLE  (DEPENDENT) 

VERSUS  THE  REMAINING  VARIABLES  AS 
INDEPENDENT 


r 
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PLOT  INVERTED 

1 8 

tPLOTI  Iltl2t*«.*I50 


PARAMETER  FIELD  {COLUMNS  8-79) 

11  - INDEPENDENT  VARIABLE  (HORIZONTAL  AXIS) 

12  - 

..  DEPENDENT  VARIABLES  (VERTICAL  AXIS) 

• • 

150  - 


OUTPUT 

A PLOT  IS  MADE  OF  THE  INDEPENDENT  VARIABLE  VERSUS  EACH 
OF  THE  DEPENDENT  VARIABLES  USING  THE  SYMBOL  -A-  FOR 
II  VERSUS  I2t  the  SYMBOL  -B-  FOR  11  VERSUS  13,  ETC., 
THROUGH  THE  GOC  FORTRAN  CHARACTER  SET. 


USAGE 

1.  A MAXIMUM  OF  1 INDEPENDENT  VARIABLE  AND  49  DEPENDENT 
VARIABLES  MAY  BE  USED. 

2.  NO  MORE  THAN  2500  DATA  PCINTS  (INCLUDING  MISSING  DATA) 
ARE  ALLOWED. 

3.  AUTO- SCALING  IS  USED  IF  THE  SSCALE  CARO  DOES  NOT  PRECEDE 
THE  SPLOTl  CARD. 

4.  REFER  TO  THE  SSCALE  DOCUMENTATION  FOR  MANUAL  SCALING 
OF  THE  AXES. 

5.  THE  PLOT  SHOULD  BE  ROTATED  90  DEGREES  COUNTERCLOCKWISE 
FOR  PROPER  ALIGNMENT  OF  THE  AXES. 


EXAMPLES 
CAROS 
1 8 
SPLOTI  7,1,4 

SPLOTI  ALL 


COMMENTS 

PLOT  CF  INDEPENDENT  VARIABLE  7 
VERSUS  DEPENDENT  VARIABLES  1 AND  4 
PLOT  OF  THE  1ST  VARIABLE  (INDEPENDENT) 
VERSUS  THE  REMAINIG  VARIABLES  AS 
DEPENDENT 


c 
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F>0ISTRIBUT10N 

I 3 

$FOIST  I1,I2«13 


PARANETER  FIELD  (COLUMNS  8-791 

11  - VARIABLE  CONTAINING  THE  NUMERATOR  DEGREES  OF  FREEDOM  - R1 

12  - variable  CONTAINING  THE  DENOMINATOR  DEGREES  OF  FREEDOM  - R2 

13  - variable  containing  the  F-STATISTIC 


OUTPUT 

THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  -F-,  WITH 
Rl  AND  R2  DEGREES  OF  FREEDOM,  GREATER  THAN  THE  INPUT 
F-VALUE  IS  PRINTED  AS  THE  LEVEL  SIGNIFICANCE. 


EXAMPLE 

CARO  COMMENTS 

1 8 

SFOIST  3,7,9  RI  IS  IN  VARIABLE  3. 

R2  IS  IN  VARIABLE  7. 

THE  F-STATISTIC  IS  IN  VARIABLE  9. 


1 
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CHI-SQUARE  OISTRieUTION 

1 8 
8COIST  11«12 


PARAMETER  FIELD  I COLUMNS  8-79) 


11  - VARIABLE 

CONTAINING 

THE 

DEGREES  OF 

FREEDOM 

12  - VARIABLE 

CONTAINING 

THE 

CHI-SQUARE 

STATISTIC 

OUTPUT 

THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  CHl-SQUARE* 
WITH  THE  GIVEN  DEGREES  OF  FREEOGMt  GREATER  THAN  THE 
INPUT  CHI-SQUARE  IS  PRINTED  AS  THE  LEVEL  OF  SIGNIFICANCE. 


EXAMPLE 

CARO  COMMENT 

1 8 

8C01ST  4t6  THE  DEGREES  OF  FREEDOM  ARE  IN  VARIABLE  4. 

THE  CHI-SQUARE  STATISTIC  IS  IN  VARIABLE  6. 


c 


STATISTICAL  ROUTINE  SPECIFICATION  CAROS 


89 


T-OISTRIBUTION 

1 8 
8TDIST  IltI2 

PARAMETER  FIELD  (CCLUNNS  8-79) 

11  - VARIABLE  CONTAINING  THE  DEGREES  OF  FREEDOM 

12  - VARIABLE  CONTAINING  THE  T-STATISTIC 


OUTPUT 

TWICE  THE  PROBABILITY  OF  FINDING  A RANDOM  VALUE  OF  -T-, 
WITH  THE  GIVEN  DEGREES  OF  FREEDOM,  GREATER  THAN  THE 
ABSOLUTE  VALUE  OF  THE  INPUT  T VALUE  IS  PRINTED  AS  THE 
LEVEL  OF  SIGNIFICANCE. 


EXAMPLE 

CARD 

1 

8 

COMMENT 

STOIST 

1*2 

DEGREES 

OF 

FREEDOM  ARE 

IN  VARIABLE  1 

T-VALUE 

IS 

IN  VARIABLE 

2. 
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MILCOXONfS  RANK  SUM  AND  S16NE0  RANKS  OISTRIRUTIOMS 

I 8 

SRSDST  ll*I2tl3t 


PARAMETER  FIELD  (COLUMNS  6-T9I 

11  - VARIABLE  CONTAINING  THE  NUMBER  OF  OBSERVATIONS  -M- 

OF  THE  FIRST  VARIABLE  (XI 

12  - VARIABLE  CONTAINING  THE  NUMBER  OF  CBSERVATICNS  >N- 

OF  THE  SECOND  VARIABLE  (Yl 

13  - variable  CONTAINING  WiLCOXONtS  RANK  SUM  OR  SIGNED 

RANKS  STATISTIC 


OUTPUT 

LEVEL  OF  SIGNIFICANCE  OF  THE  INPUT  WiLCOXONtS  RANK  SUM 
OR  SIGNED  RANKS  STATISTIC  IS  PRINTED. 


USAGE 

1.  WHEN  INPUTTING  THE  WiLCOXONtS  SIGNED  RANKS  STATISTIC* 
SET  M*0  AND  N SHOULD  BE  THE  NUMBER  OF  DIFFERENCES. 

2.  THIS  ROUTINE  IS  LIMITED  TO  M*NS30. 


EXAMPLE 

CARO  COMMENT 

1 8 

SRSOST  3*4*6  M IS  IN  VARIABLE  3. 

N IS  IN  VARIABLE  4. 

THE  WILCOXON  STATISTIC  IS  IN  VARIABLE  6. 


AD-A035  754 


unclassified 


2<f2 

404035754 


LETTE^^MAN  ARMY  INST  OF  RESEARCH  SAN  FRANCISCO  CALIF  F/6  12/1 

GENERALIZED  RESEARCH  ANALYSIS  STATISTICAL  SYSTEM.  SECOND  EDITIO— FTC (U) 
NOV  76  H LAZARUS*  T MCCAA*  R TEPLICK 

LAIR-33  NL 


92 


I 


I 

f 

t 

\ 

I 

JOB  CARO 

FETCHPS  (GRASZ.6RAZZ. GRASS) 

GRAZZ. 

7/8/9 

STITLE  EXAMPLE  1 - HYPOTHETICAL  RESEARCH  STUDY 
SOATA  4,6.1 

SF0RMAT(F2.0,F5.C,F4.1,F4.0) 

20  10.0  989  123 
18  15.8  987  147 
35  8.8  991  109 
24  8.0  986  118 
. 41  12.5  994  131 

i 43  12.8  997  135 

SBSTAT  ALL 
SCORR  ALL 
SRNKCR  ALL 
$HREG  2. 1,3,4 
SEND 
6/7/B/9 


FIGURE  2 
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FIGURE  3 
COMPUTER  OUTPUT 
EXV1PLE  1 
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i GRAZZ. 

7/b/9 

ill TIE  EXAMPLE  2,  ANALYSIS  OF  HEART  RATE  DATA 
SCMT  EXAMPLE 2 
$CMT  VARIABLE  1 IS  HEIGHT 
!>CMT  VARIABLE  2 IS  WEIGHT 
>CMT  VARIABLE  3 IS  AGE 
5>CMT  variable  4 IS  HEART  RATE 
iOATA  4,8,1 
iF0RMAT(F2.0,lX,3F3.0) 

6b  IbO  22  70 

69  ICO  23  78 

70  169  25  79 

71  167  27  82 

72  170  29  83 

73  175  24  87 

74  180  23  89 

75  180  21  72 

sbstat  all 

5C0RR  ALL 

STITLE  (EXAMPLE  2)  HEART  RATE  IS  4,  WEIGHT  IS  2 
SLSQR  4,2 
SPLOT  4,2 

bTITLE  HEART  RATE  IS  4,  HEIGHT  IS  1 
SPOLRG  4,1,3 

STITLE  DEPENDENT-WEIGHT  INDEPENDENT-HEIGHT  AND  AGE 
SSWREG  2,1,3 
SPLOTN  2,1,3 
SEND 
6/778/9 


FIGURE  16 
DECK  SETUP 
EXAMPLE  2 
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UXAHTLE  2)HEART  KAU  IS  4.  UtIGIlT  IS  2 


— PLOT  RUOTINL — 
VAR.  4 VERSUS  VAR.  2 
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87. 1000.. .♦ 


8S.2000...« 
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81. 4000... ♦ 


79.SOOO...+ 
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71. 9000.. .♦ 


70.0000. 


150.0000 


156.0000 


153.0000 


159.0000 


162.0000 


1650000 


ICb.OOOO 


171.0000 


174.0000 


177, 


FIGURE  23 
CUKPUTER  OUTPUT 
EXAMPLE  2 
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FIGURE  25 
COMPUTER  OUTPUT 
EXAMPLE  2 


DEPENDENT  -WEIGHT  INDEPENDENT-HEIGHT  AND  AGE 


116 


COMPUTER  OUTPUT 
EXAMPLE  2 


u> 

fs* 

00 

CM 


VO 


<n  o 

ro 

o 


U. 

o 

o 

o 

• 

o 

UI  UJ 

in 

o»  o 

o>  o 

h-  X 

00 

^ o 

^ o 

X ^ 

00 

1—  CTi 

»—  o> 

VO 

z < 

• 

• • 

• • 

X > 

rsfc 

CM 

CM 

O 1 

LT> 

vr> 

O h- 

VO 

VO 

Ov 

00  o 

CM 

UJ 

o z 

00 

o 

o\ 

CO  ^ 

< 

• 

• 

• 

CO  p- 

CO 

z 

o 

vn 

> 

o • 

z 

• • 

• • 

o 

z 

r— 

< 

• • 

• • 

• 

• • 

< • 

z z 

CM 

« • 

• • 

• 

• • 

• 

O LU 

On. 

►— 

• • 

• • 

• 

• • 

z • 

z o 

CO 

X 

o.  • 

« • 

• 

• • 

o • 

z o 

vn 

o 

UI  • 

• • 

h- 

UI  • 

u.  • 

z 

• 

*-  • 

o • 

Z 

o • 

• 

• 

LU 

CM 

v/>  • 

UI  • 

UI 

z • 

UI  • 

• C5 

X 

Q. 

o • 

< ♦ 

X • 

o z 

1 

• 

l/>  UI 

X • 

u> 

^ • 

^ • 

I-  z 

1— 

• 

»-•  H- 

o o 

z • 

«c  • 

VO 

z 

• 

X U) 

UI  UI 

u» 

< > 

> UI 

UJ 

• 

h- 

X o 

u. 

• •— 

o 

• 

00 

X 

UI 

< 

u.  < 

z 

• 

Z H-l 

00  Q 

O 

u. 

X 

1- 

UJ 

• 

M X 

UI  UJ 

Q 

o 

o z 

Ll.  *-» 

z z 

00  to 

• 

I— 

OC  0£. 

UJ 

o 

O h- 

o z 

o 

UJ 

• 

o 

< 

z 

z 

to  u. 

to 

H- I »— « 

o o% 

o 

• 

• 

UI  z 

X z 

UJ 

o 

|aH 

UI  UI 

VO  o 

^ VO 

z 

• 

• 

O 

OrO 

h- 

to  X 

o 

to  »-i 

o>  00 

• 

• 

X 

oo  •-< 

Z 

>-  Q 

z u. 

z z 

• • 

UI 

• 

o o 

h* 

UJ 

< 

^ s 

<o 

z z 

CO  CM 

H- 

_J 

• 

UJ  UI 

u.  oc 

-U 

< UI 

o 

C9  Z 

p* 

X 

CO 

o 

X o 

o o 

vn 

UJ 

Z UI 

N-I  z 

z o 

pv 

o 

< 

UI 

X 

o. 

UJ 

z 

<c  z 

u.  o 

z o 

1 

X 

</)  o 

xo 

z 

u. 

I-I  z 

UJ 

UI 

UI  UJ 

X X 

CO 

o 

z 

z z 

3 

< 

h- 

uo  O. 

< 

o 

o u. 

CJ  UI 

1 

> 

z 

< 

u.  o 

h- 

UI 

X z 

UJ  UI 

QC 

UJ 

to  o 

z 

z 

h- 

1— 

O'O 

> > 

< 

UI  to 

z 

■ z 

z 

z 

z 

UJ 

l/>  1-^ 

> 

z 

X UI 

u.  < 

CO  z 

o 

UJ 

UJ 

-J 

h— 

h-  h- 

•J  UI 

o o 

< CO 

^ z 

o 

o 

m 

U.  X 

<< 

< z 

z 

«-*  X 

z 

z 

z 

< 

o o 

□ ^ 

•J 

> CO 

-J  < 

Z X 

h- 

LU 

a. 

UI 

a. 

X X 

X 

1 UI 

UI  1— 

< X 

z 

o. 

UJ 

a. 

X o 

X X 

z 

X 

U.  Q 

>•  CO 

> 

P-4 

Ui 

Ui 

< 

X X 

X X 

o 

UI 

o 

</) 

o 

> 

oo  a. 

o o 

u. 

o 

o 


118 


U9 


OtPEHOtNT  -HtlCHT  IMOCPEHOtNT-Ht  IGU  AGE 

VARIABLE  2 VS  VARIABLES  1.  3. 

180.0000.. .B  B 


— PLOTN  ROUTINE' 


A A 


177.0000. . .♦ 

B 

174.0000. . .« 

171.0000. . .♦ 

168.0000. . .« 

165.0000. ..* 

162.0000. ..* 

*.  B 

159.0000...* 


156.0000...* 


A 


A 


A 


A 


A 


153.0000...* 


150.0000 


..;b * ♦ ♦ * * * * * 


21.0000 


26.4000 


31.8000 


42.6000 


37.2000 


48.0000 


53.4000 


64.2000 


58.8000 


69.6000 


75.0000 


FIGURE  29 
COMWTIR  OUTPUT 
EXAMPLE  2 
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GRAZZ. 

7/8/9 

$TITLE  EXAMPLE  3 - ANALYSIS  OF  ACTIVITY  OF  TEST  ANIMALS  ON  THREE  DIETS 

JCMT  EXAMPLE  3 

$CMT  VARIABLE  1 IS  DIET  1 

$CMT  VARIABLE  2 IS  DIET  2 

SCMT  VARIABLE  3 IS  DIET  3 

$DATA  3,8,1 

$F0RMAT(3F3.0) 

14  12  19 

15  16  17 

14  12  19 

17  13  17 

16  17  18 

18  14  19 

19  13  16 

15  16  17 
$ANOVA  ALL 
SNKMRT  ALL 
$KRUWC  ALL 
SEND 
6/7/8Z9 


FIGURE  30 
DECK  SETUP 
EXAMPLE  3 
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EXAMPLE  3 
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EXAMPLl  3 - ANALYSIS  OF  ACTIVITY  OF  TEST  ANIMALS  ON  THNEE  DIETS 


JOB  CARO 

FETCHPS  (GRASZ.GRAZZ, GRASS) 

GRAZZ. 

7/8/9 

STITLE  EXAMPLE  4,  TWO  STUDIES  INVOLVING  ACTIVITY  OF  SUBJECTS  ON  2 DIETS 
SCMT  EXAMPLE  4 

SCHT  VARIABLE  1 IS  CONTROL  DIET  STUDY  1 

>CMT  VARIABLE  2 IS  ENRICHED  DIET  STUDY  I 
SCMT  VARIABLE  3 IS  CONTROL  DIET  STUDY  2 
iCMT  VARIABLE  4 IS  ENRICHED  DIET  STUDY  2 
SDATA  4,9,1 
SF0RMAT(4F3.0) 

15  13  13  15 
17  19  15  19 
10  15  14  13 
19  17  10  20 

12  16  12  17 

13  15  17  16 

16  18  17  15 

18  18 
15 

SBSTAT  ALL 

STITLE  (EXAMPLE  4)  PAIRED  T-TEST  FOR  STUDY  1 DATA 
>PTST  1,2 

STITLE  (EXAMPLE  4)  WILCOXONfS  SIGNED  RANK  TEST  FOR  STUDY  1 DATA 
SSNRNK  1,2 

STITLE  (EXAMPLE  4)  NON-PAIRED  T-TEST  FOR  STUDY  2 DATA 
SNTST  3,4 

STITLE  (EXAMPLE  4)  WILCOXONfS  RANK  SUM  TEST  FOR  STUDY  2 DATA 

SRNKSM  3,4 

SEND 

6/7/8/9 


FIGURE  38 
DECK  SETUP 
EXAMPLE  4 


GENERALIZED  RESEARCH 


FIGURE  39 
COMPUTER  OUTPUT 
EXAMPLE  4 


EXAMPLE  4, TWO  STUDIES  INVOLVING  ACTIVITY  OF  SUBJECTS  ON  2 DIETS 


COMPUTER  OUTPUT 
EXAMPLE  4 


EXAMPLE  4.TW0  STUDIES  INVOLVING  ACTIVITY  OF  SUBJECTS  ON  2 DIETS 


EXAMPLE  4,TW0  STUDIES  INVOLVING  ACTIVITY  OF  SUBJECTS  ON  2 DIETS 
— BSTAT  ROUTINE — 


COMPUTER  OUTPUT 
EXAMPLE  4 


(EXAMPLE  4)PAIRED  T-TEST  FOR  STUDY  1 DATA 
— -PAIRED  T-TEST  ROUTINE- — 06  DEC  74 


FIGURE  43 
COMPUTER  OUTPUT 
EXAMPLE  4 


FIGURE  44 
COMPUTER  OUTPUT 
EXAMPLE  4 
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FIGURE  47 
COMPUTER  OUTPUT 
EXAMPLE  4 
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GRAZZ. 

7/8/9 

STITLE  EXAMPLE  5A.USING  STRAWS  FUNCTIONS  AND  ARITHMETIC  FUNCTIONS 

SDATA  2,4,1 

STRAWS  X(3)«SIN(X(1)} 

STRAWS  X(4)«ASIN(X(3)) 

STRAWS  X(5)«L06(X(2)) 

STRAWS  X(6)-10*(X(l)f2+X(5)) 

SF0RMAT(2F5.0) 

0 15 

.87  25 

1.239  30 

1.501  90 

STITLE  EXAMPLE  5B,USING  STRAWS  IF-THEN-ELSE  STATEMENTS 

SCMT  THE  FIRST  3STRANS  STATEMENTS  SHOW  HOW  TO  RECODE  BLANK  DATA 

SDATA  3,9,1 

STRAWS  IF  X(3)=0  THEN  IF  LOGICAL  (X(3))=l  THEN  X(4)=2 
STRAWS  ELSE  X(4)=X(3) 

STRAWS  ELSE  X(4)*X{3) 

STRAWS  IF  X(l)*2  THEN  X(5)=X(4)f2  ELSE  X(5)=X{2)+X(1 ) 
5F0RMAT(3F5.0) 

0 0 1 

1 1 

2 1 

3 2 4 

4 2 5 

5 3 6 

3 7 

4 8 

I 9 

STITLE  EXAMPLE  5C,  USING  STRAWS  TRANSPOSE  AND  SADD 

SDATA  6,2,1 

SFORMAT(6F2.0) 

1 2 3 

4 5 6 7 8 9 
STRAWS  TRANSPOSE 
SEXECU 

SADD  1,2,3 

SEND 

6/7/B/9 


FIGURE  48 
DECK  SETUP 
EXAMPLE  5 


FIGURE  49 
COMPUTER  OUTPUT 
EXAMPLE  5 
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FIGURE  51 
COMPUTER  OUTPUT 
EXAMPLE  5 
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EXAMPLE  5B.USING  STRAWS  IF-THEN-ELSE  STATEMENTS 


FIGURE  53 
COMPUTER  OUTPUT 
EXAMPLE  5 


EXAMPLE  5B.USING  STRANS  IF-THEN-ELSE  STATEMENTS 
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EXAMPLE  5C,  USING  $TRANS  TRANSPOSE  AND  $ADD 


FIGURE  55 
COMPUTER  OUTPUT 
EXAMPLE  5 


FIGURE  57 
COMPUTER  OUTPUT 
EXAMPLE  5 
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